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DNA SEQUENCING BY MASS SPECTROMETRY 

Background nf the Invention 

Since the genetic information is represented by the seauence of the four 
DNA building blocks deoxyadenosine- (dpA). deoxyguanosine- (dpG). deoxvcvudine- 
(dpC) and deoxvthymidmeo'-phosphate (dpT), DNA sequencing is one of the most 
fundamental technologies in molecular biology and the life sciences in general The ease 
and the rate by which DNA sequences can be obtained greatly affects related technologies 
such as development and production of new therapeutic agents and new and useful 
varieties of plants and microorganisms via recombinant DNA technology. In particular 
unraveling the DNA sequence helps in understanding human pathological conditions 
including genetic disorders, cancer and AIDS. In some cases, verv subtle differences such 
as a one nucleotide deletion, addition or substitution can create serious, in some cases even 
fatal, consequences. Recently. DNA sequencing has become the core technology of the 
Human Genome Sequencing Project (e.g.. J.E. Bishop and M. Waldholz. 1991 fisnomE: 
The Storv of the Most Asropj^p Scien tific Adventur e of Q.,r tw . tu. Tn 
Map All the Genes in the Humnp Horry , Simon & Schuster. New York) Knowledge of 
the complete human genome DNA sequence will certainly help to understand, to diagnose 
to prevent and to treat human diseases. To be able to tackle successfully the determination 
of the approximately 3 billion base pairs of the human genome m a reasonable time frame 
and m an economical way, rapid, reliable, sensitive and inexpensive methods need to be 
developed, which also offer the possibility of automation. The present invention provides 
such a technology. 

Recent reviews of today's methods together with future directions and trends 
are given by Barrell ( The FASFP Journal 5_, 40-45 (1991 )). and Trainor ( .Anal. Chem 
418-26 (1990)). 

Currently. DNA sequencing is performed by either the chemical degradation 
method of Maxarn and Gilbert (Method, in Fn^. 4 99-560 (1980)) or A- 
enzymatic dideoxynucleotide termination method of Sanger et al ( Proc. N„tl a.,h <:„,• 
ISA 74, 5463-67 (1977)). In the chem.cal method, base specific modifications result in a 
base specific cleavage of the radioactive or fluorescently labeled DNA fragment With the 
four separate base specific cleavage reactions, four sets of nested fragments are produced 
which are separated according to length by polyacrylam.de gel electrophoresis (PAGE) 
After autoradiography, the seauence car He - 



In the enzymatic chain termination method, the four base specific set< o r 
DNA fragments are formed by staning with a primer/template system elongating the ' 
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£ «/, DNA polvmerase I. a DNA polvmerase from TH crmus aguallcm Ta0 D =™ 
po._ or a mod.fied T7 DNA poiymerase. Sequent (Tabor „ ./ . 
**ff±* «*--477, W». .n me presence of cha,n,ermma,~,s 
react'o Cha ' n -' Cmiina,inS ™" * " * "-8 , nt0 me four "separate ' 

dGTP, dTTP and dCTP. only one of ,he chain-terming dideox^ucleostde 
tnphospha.es. ddATP. ddGTP. ddTTP „r ddCTP. respecvelv, in \ llmitin „ small 
co„c_. ^ - of W fragments produce, after electropC fo ur 
0 base specific ladders from which the DNA sequence can be determined 

A recent modification of tne Sanger sequencing strategy involves the 

f — ^""^ ™A *™ obtatJd by usine a t h , th , 0 
1Mf ,nstead of the normal! v used ddNTPs durino the nn mo • " 

h„ r>MA . ' ""^ * rs aunng tne pnmer extension reaction mediated 
by DNA polymerase (Labeit ei al. , DN \ 5 173-177 riQRfiV a u UOn mediated 

; r,R»^, n .r.,... . . ^ 1 /7 (1 986 -»' ^"^am. PCT-Apolication 

sequencmg 

snake venom phosphodiesterase, subsequent separation on PAGE and v.suZ " b 
rad,o,so,op,c labeling of either the primer or one of the dNTPs in a fS^Z r 
*■ b r specific ceavage is achieved by a ,ky Iau „g the sulph^ ^^^T 
p spho tester bond followed bv a hea, batmen. (Max-Planck-Geseilscha OS 
A ). Both methods cat, be combined with the amplification of the DNA via he 
Polymerase Chain Reaction (PCR). 

On the upfront end, me DNA to be sequenced has to be fomented into 
ue b e p,e C es of current,. n0 , more matt 5 00 to ,000 nucleotides. Star," a 
TO .h,s ,s a multt-step process involving clomng and subcloning steps ustn^en, 
and approbate clora „g vectors such as YAC. cosmids, plasmids and M13 vecjs 

fr " ^^^^^te^ Cold Spnng hIoT 
Ubo,,o Press . , 989 , finaIIj . for Sanger ^^^^ ^ ^ S Harb r 

000 base pa.rs are mtegrated mto a specific restnction site of the rep.icative form I n „ 

hen TTZ baCKn ° Phage ^ Md a-ii =59 

a template for the Sanger sequencmg process having a binding s.te for a umversal 
pnmer o tamed by chem.ca, DNA synthesis (Sum, Biemat. McManus and Lter 

U. ,539-57 (,9S4, : U.S. Patent No. 4725677 upstrel «Z 
rcstna.cn s„e mto which the unknown DNA fraemen, K.„ „1. ! 



rh ■ ' ^ " ^ne^LJ^i. 32-39 (1988.. and. with th^ vrne . as . 

Cnam Reaction fPCR) (PfR p^t^i. * ■ . ronme.ase 
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editors. Academic Press. San Diego (1990)) cloning or subcloning steps could be omitted 
by directly sequencing off chromosomal DNA by first amplifying the DNA socmen: bv 
PCR and then applying the Sanger sequencing method (Innis et al. P r oc Natl. Acad Sci. 
USA Si, 9436-94-10 ( 1988)). In this case, however, the DNA sequence in the interested 
5 region most be known at least to the extent to bind a sequencing primer. 

In order to be able to read the sequence from PAGE, detectable labels have 
to be used in either the primer (very often at the 5-end) or in one of the deoxynucleoside 
triphosphates. dNTP. Using radioisotopes such as ^-P. ^3p or 3>g j s sl jjj ^ most 
frequently used technique. After PAGE, the gels are exposed to X-ray films and silver 
10 grain exposure is analyzed. The use of radioisotopic labeling creates several problems. 

Most labels useful for autoradiographic detection of sequencing fragements have relativeh 
short half-lives which can limit the useful time of the labels. The emission hish enercv 
beta radiation, particularly from ~~P. can lead to breakdown of the products via radiohsis 
so that the sample should be used very quickly after labeling. In addition, high enersv 

15 radiation can also cause a deterioration of band sharpness by scattering. Some of these 
problems can be reduced by using the less energetic isotopes such as 33p or 35$ ( see> e g 
Ornstein et al, Biotechniques 2. 476 (1985)). Here, however, longer exposure times have 
to be tolerated. Above all. the use of radioisotopes poses significant health risks to the 
experimentalist and. in heavy sequencing projects, decontamination and handling the 

20 radioactive waste are other severe problems and burdens. 

In response to the above mentioned problems related to the use of radioactive 
labels, non-radioactive labeling techniques have been explored and. in recent vears, 
integrated into partly automated DNA sequencing procedures. All these improvements 
utilize the Sanger sequencing strategy. The fluorescent label can be tagged to the primer 

25 (Smith et al. Nature 321 . 674-679 (1986) and EPO Patent No. 87300998.9; Du Pont De 
Nemours EPO Application No. 0359225; Ansorge et al J. Biochem. Biophvs. Methods 
U, 325-32 (1986) ) or to the chain-terminating dideoxynucloside triphosphates (Prober et 
al Science 218. 336-41 (1987); Applied Biosystems, PCT Application WO 91/05060 ). 
Based on either label ing the primer or the ddNTP. svstems have been developed by 

30 Applied Biosystems (Smith et al. Science 235 . G89 (1987); U.S. Patent Nos. 570973 and 
689013), Du Pont De Nemours (Prober et al Science 238 . 336-341 (1987); U.S. Patents 
Nos. 881372 and 57566). Pharmacia-LKB (Ansorge et al Nucleic Acids Res. I 5. 4593- 
4602 (1987) and EMBL Patent Application DE P3724442 and P3805808.1 ) and Hitachi 
(JP 1-90844 and DE 401 1991 Al). A somewhat similar approach was developed by 

35 Brumbaugh er a! ('Proc Natl t . tq;a 1 1 : — - - 

A difierent approach uses fiuorescently labeled avidin and biotm labeled primers. Here, 
the sequencing ladders ending with biotin are reacted during electrophoresis with the 
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labeled avidin which results in the detection of the individual sequencing bands 
(Brumbaugh et aL U.S. Patent No. 594676). 

More recently even more sensitive non-radioactive labeling techniques for 
DNA using chemiluminescence tnggerable ard amplifyable by enzymes have been 
5 developed (Beck. O'Keefe. Coull and Koster. Nucleic Acids Res. 17, 5 1 1 5-5 1 23 ( 1 989) 
and Beck and Koster, Anal. Chenv £2- 2258-2270 (1990)). These labeling methods were 
combined with multiplex DNA sequencing (Church et al Science 240, 185-188 (1 988 ) to 
provide for a strategy aimed at high throughput DNA sequencing (Koster et ai % 
Nucleic Acids Res. Symposium Ser. No 74 ^IS-^PI (1991), University of Utah, PCT 
10 Application No. WO 90/15883): this strategy still suffers from the disadvantage of being 
very laborious and difficult to automate. 

In an attempt to simplify* DNA sequencing, solid supports have been 
introduced. In most cases published so far. the template strand for sequencing (with or 
without PCR amplification) is immobilized on a solid support most frequently utilizing the 
15 strong biotm-avidin/strcptavidm interaction (Orion- Yhtyma Oy, U.S. Patent No. 277643; 
M. Uhlen et al Nucleic Acids Res. 1£, 3025-38 (1988); Cemu Bioteknik. PCT 
Application No. WO 89/09282 and Medical Research Council, GB, PCT Application No. 
WO 92/03575). The primer extension products synthesized on the immobilized template 
strand are purified of enzymes, other sequencing reagents and by-products by a washing 
20 step and then released under denaturing conditions by loosing the hydrogen bonds 

between the Watson-Crick base pairs and subjected to PAGE separation. In a different 
approach, the primer extension products (not the template) from a DNA sequencing 
reaction are bound to a solid support via biotin/avidin (Du Pont De Nemours, PCT 
Application WO 91/1 1 533). In contrast to the above mentioned methods, here, the 
25 interaction between biotin and avidin is overcome by employing denaturing conditions 
(formamide/EDTA) to release the primer extension products of the sequencing reaction 
from the solid support for PAGE separation. As solid supports, beads, (e.g., magnetic 
beads (Dynabeads) and Sepharose beads), filters, capillaries, plastic dipsticks (e.g., 
polystyrene snips) and microtiter wells are being proposed. 
30 All methods discussed so far have one central step in common: 

polyacrylamide gel electrophoresis (PAGE). In many instances, this represents a major 
drawback and limitation for each of these methods. Preparing a homogeneous gel by 
polymerization, loading of the samples, the electrophoresis itself, detection of the 
sequence pattern (e.g., by autoradiography), removing the gel and cleaning the glass plates 
35 to prepare another gel are very laborious and time-consuming procedures Mnreov- tK, 



- L ' ° : rjujua.u ^- idOLMnu. auto:aaiograpn\ useli can consume from hours ro davs. In 
the case of fluorescent labeling, at ieast the detection of the sequencing bands is being 
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performed automatically when using the laser-scanning devices integrated into 
commercial available DNA sequencers. One problem related to the fluorescent labelmc i 
the influence of the four different base-specific fluorescent tags on the mobility of the ~ 
fragments during electrophoresis and a possible overlap in the spectral bandwidth of the 
four specific dyes reducing the discriminating power between neighboring bands, hence, 
increasing the probability of sequence ambiguities. .Artifacts are also produced by base- 
specific interactions with the polyacrylamide gel matrix (Frank and Koster. Nucleic 
*tids ^ 6- 2069 H979)) and by the formation of secondary structures which result in 
"band compressions" and hence do not allow one to read the sequence. This problem has. 
in part, been overcome by using 7-deazadeoxyguanosine triphosphates (Barr et a!., 
BiQtechniques 4. 428 (1986)). However, the reasons for some artifacts and conspicuous 
bands are still under investigation and need further improvement of the gel electrophoretic 
procedure. 

A recent innovation in electrophoresis is capillarv zone elecrmnhnr-pJr 
(CZE) (Jorgenson et a!.. J Chromatography 3_5_2. 337 (1986): Gesteland et ai, 
Nucleic Acids Prs 1&. 1415-1439 (1990)) which, compared to slab gel electrophoresis 
(PAGE), significantly increases the resolution of the separation, reduces the time for an 
electrophoretic run and allows the analysis of very small samples. Here, however, other 
problems arise due to the miniaturization of the whole system such as wall effects and the 
necessity of highly sensitive on-line detection methods. Compared to PAGE, another 
drawback is created by the fact that CZE is only a "one-lane" process, whereas in PAGE 
samples in multiple lanes can be electrophoresed simultaneously. 

Due to the severe limitations and problems related to having PAGE as an 
integral and central part in the standard DNA sequencing protocol, several methods have 
been proposed to do DNA sequencing without an electrophoretic step. One approach calls 
for hybridization or fragmentation sequencing (Bains, Biotechnology 1Q_, 757-58 (1992) 
and Mirzabekov et al, FEBS Letters 256, 1 1 8-122 (1989)) utilizing the specific 
hybridization of known short oligonucleotides (e.g.. octadeoxynucleotides which gives 
65,536 different sequences) to a complementary DNA sequence. Positive hybridization 
30 reveals a short stretch of the unknown sequence. Repeating this process by performing 
hybridizations with all possible octadeoxynucleotides should theoretically determine the 
sequence. In a completely different approach, rapid sequencing of DNA is done bv 
unilaterally degrading one single, immobilized DNA fragment by an exonuclease in a 
moving flow stream and detecting the cleaved nucleotides by their specific fluorescent tas 
35 via laser excitation (Jert et a' J BiTnol?~»V ^— - r r 
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generated when tine correct nucleotide is attached to the growing chain on a primer- 
template system is used to determine the DNA sequence. The enzymes used 2nd the n\ • 
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are held ,„ place b ? solid phases (DEAE-Sephatose and Sepharose, either b> ,ornc 
n,erac,,ons or by covalen, a,achme„t. ]n . C0M , nu0 „ s fl ^ ^ °™ 

^Pyrophosphate is defined v,a bioluminescence flucW, A sv^ 
DNA se^ncmg ,s a,so used by Tsien „ ai (PC T Apphcation No. WO 9, W 7« Her 
5 *e .ncomt„g dNTPs are protected a, the 3'-end bv vanous blocking er0 ups such L ace" ' 
or phosphate groups and are removed before the nexI clon?alio „ £ h 
process very slow compared ,o standard sequencing memods. The . mplate DNA is 
mmob.hzed on a poller support. To detect incorporation, a fluoresce', or^o ,,ve 
label ,s additionally incorporated in.o the modified dNTP'q Th radioacn e 

.0 also describes an apparatus designed ,o MoJTZL ^ 

Mass spectrometry, in general, provides a means of "wenzhintt" individual 
mo ecu =s byiontztng the molecules *, vaco attd making mem "flV bv vJL^T 
Under ,he influence of combir.a.ions of elecrric and magnetic fie.ds'. the ,ons o low 
Rectories depending on the.r individual mass ,m> «, charge (z,. ,„ me ran 7 

5 molecules with Invi- mni»^,u tcul 

t . . VVC1 ^ 1L - ma5S spectrometry has long been pan of the m ,m„. 

physical-organic repenoire for analysis and charac.enza.ion of organic m o^u et b T 
e,_o„ of .he mass of me paren, molecular ion. ,„ addition", bv arr^,„ g o ilns 

ZZZ °** " (e * -»X *e mo,;cu ar t s 

Sri " 8 SCCOndar> ' b> ' S0 " Ca " ed ««*« "issocianon (C D) 

Th fragment,™ pathway very often allows me deriva.ion of defiled Ic^ 
nformaaon Marty appltcanons of ma. spec.rome.ric memods in me !cno J „~ 

VOU93. Mass Spectrome,r> " (J.A. McCloskey. edi.or), I 990^ Academic P^sX 

Due to the apparent analytical advantages of mass snecr-mm^, • 

byC D tn conjunction with an MS/MS configuration and speed, as well as on-lta Ta» 
ttansfer ,o a compu,er. mere has been considerable interest m me use of mass s ecrlmetrv 
for .he structural analvsis of nucleic acids R„.„, ot mass spectrometry 

i' tj r i • °' nucle,c ac '<ls- Reeen. reviews summarizing mis field include 

K- H. Schram, Mass Spectrometry of Nucleic Acid Components, Biomedical 
APP canons of Mass Spectrometry" a. ( , 990); and P.P. Cra.n "m! 

> 4 (1990). The b.gges. hurdle ,o applymg mass spirometry to nude,c acids s me' 
difficulty of volatiltzmg these verv polar biopolvmers Th„,f„ •■ 
Hmaed .o low moleculat we, gh , syndic .£^£^2^" ^ *" 



* ' " ~- ;vi - '••onnguration uinizinc. in narucuiar for the 
—..on ^ volatilization t„= method of fast atom.c bomba,ament (F.A3 mass 
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spectrometry, or plasma desorpuon ,PD mass spectrometry). As an example the 
application of FAB to the analysis of protected dimeric blocks for chemical svnthes-s of 
ol.godeoxynucleoudes has been described (Koster et al. Bio^, C ^^ T My . 
Spectrometry M. 1 11-1 16(1987)). 

» Two more recent 'oruzatioa/desorption techniques are electrosprav/ionsprav 

(ES) and matnx-assisted laser desorption/ionization (MALDI). ES mass spectrometry has 

WoZTZ'Z Feim €l aL (U *^ £i **- * 445 ] C 984): PCI Application No. 
WO 90/14148) and current applications are summarized in recent review articles (R D 
Smuh et al, Anal C hen, <£. 882-89 (1 990) and B. Ardrev. Electrosprav Mass 
Spectrometry, Spectroscopy Fnm pr. 4, 10-18 (1992)). The molecular weiehts of the 
tetradecanucleotide d(CATGCCATGGCATG) (SEQ ID NO: 1 ) (Covev et al "The 
Determination of Protein. Oligonucleotide and Peptide Molecular Weights bv lonsprav 
Mass Spectrometry." Rapid Communications in M,«« ->~ -, 49 ^ 6 ' 

°f*e-l-merd(A.^TTGTGCACATCCTr,rArwrwccnmx^^. , .. 
, -yv-^v i>u.^j ana witnout civinc 

details of that of a tRNA with 76 nucleotides (Method, ,n Pn^. m ^ ' 

Spectromet^" (McCloskey. editor), p. 425. 1990, Academic Press. New York) have been 
published As a mass analyzer, a quadruple is most frequently used. The detennination 
of mo ecular weights m femtomole amounts of sample is very accurate due to the presence 
of multiple ion peaks which all could be used for the mass calculation. 

MALDI mass spectrometry, in contrast, can be particularly attractive when a 
time-of-fhght (TOP) configuration is used as a mass analyzer. The MALDI-TOF mass 
spectrometry has been introduced by Hillenkamp et al. ("Matrix Assisted UV-Laser 

ZTT^T n ' A APPr ° aCh " MaSS SpeCtr0m ^' ° f L ^ e B-molecule," 
Bm lPgieal Ma ss Sp.rtro m.tn: (Burlingame and McCloskey, editors). Elsevier Science 

Publishers, Amsterdam, pp. 49-60, 1990.) Since, in most case, no multiple molecular ion 

peaks are produced with this technique, the mass spectra, m principle, look simpler 

E u mass ™ etiy ' A,though dna m ° iecuies ^ » * **** 

of 4 o 000 daltons could be desorbed and volatilized (Williams et al. "Volatilization of 
High Molecular Weight DNA by Pulsed Laser Ab.ation of Frozen Aqueous Solutions " 
SaaiBfi. 1585-87 (,989),. this technique has so far only been used to determine the 
molecular weights of relatively small oligonucleotides of known sequence e g 
ohgothymidylic acids up to 1 8 nucleotides (Huth-Fehre et al, -Matrix-Assisted Laser 
Desorpuon Mass Spectrometry of Oligodeoxythvmidvlic Acids " 

K ^omrnUTlicm .o n s i n N jaS s Spectr ometr y , i 20 9-13 0992^ and a double-stranded 



MWwi m one publication (Hum- 



Fehre et a!.. 1902 . supra, n was shown that a mixture of all me olicothvmidviic acids 
trom n=12 to n=l 8 nucleotides could he -esolved * ' 
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In U.S. Paient No. 5.064.754. RNA transcnpts extended by DNA both of 
which are complementary to the DN'A to be sequenced are prepared by incorporating 
KTP's, dNTP's and. as terminating nucleotides. ddNTP's which are substituted at the 5'- 
position of the sugar moiety with one or a combination of the isotopes J -C. 13 C. ]4 C. ] H. 
-H, J H. 16 0. 1 y O and I8 0. The polynucleotides obtained are degraded to 3'-nucleotides. 
cleaved at the N-glycosidic linkage and the isotopically labeled 5'-functionahty removed 
by periodate oxidation and the resulting formaldehyde species determined bv mass 
spectrometry. A specific combination of isotopes serves to discriminate base-specifically 
between internal nucleotides originating from the incorporation of NTP's and dNTP's and 
terminal nucleotides caused by linking ddNTP's to the end of the polynucleotide chain. A 
series of RN.A/DNA fragments is produced, and in one embodiment, separated by 
electrophoresis, and. with the aid of the so-called matrix method of analysis, the sequence 
is deduced. 

In Japanese Patent No. 59-131909. an instrument is described which detects 
15 nucleic acid fragments separated cither by electrophoresis, liquid chromatography or high 
speed gel filtration. Mass spectrometry detection is achieved by incorporating into the 
nucleic acids atoms which normally do not occur in DNA such as S, Br. I or Ae. Au. Pt. 
Os, Hg. The method, however, is not applied to sequencing of DNA using the Sanger 
method. In particular, it does not propose a base-specific correlation of such elements to 
20 an individual ddNTP. 

PCT Application No. WO 89/12694 (Brennan et ai. Proc. SPTF-Im Son 
Q P*- E "g- 1206 - fNew Techno] . Cvtony Mol Bio].), pp. 60-77 (1990); and Brennan, U.S. 
Patent No. 5,003.059) employs the Sanger methodology for DNA sequencing by usins a 
combination of either the four stable isotopes 32 S, 33 S, 34 S, 36 S or 35 C1, 37 C1, 79 Br 
25 8 1 Br t0 specifically label the chain-terminating ddNTP's. The sulfur isotopes can be 
located either in the base or at the alpha-position of the triphosphate moiety whereas the 
halogen isotopes are located either at the base or at the 3'-position of the sugar ring. The 
sequencing reaction mixtures are separated by an electrophoretic technique such as CZE. 
transferred to a combustion unit in which the sulfur isotopes of the incorporated ddNTP's 
are transformed at about 900°C in an oxygen atmosphere. The S0 2 generated with 
masses of 64. 65. 66 or 68 is determined on-line by mass spectrometry using, e.g., as mass 
analyzer, a quadrupole with a single ion-multiplier to detect the ion current. 

A similar approach is proposed in U.S. Patent No. 5.002.868 (Jacobson et 
a/., Prpc.SPIF-1pt Sor Opt 1435. ( Opt, Method. Ultrasensitive Detect Anal J ^h 

AcpL). 26-35 (1991)) using Sanger sequencing with four ddNTP's specifically substituted 



c.L,..:...p; u . , ns omy a: nersncc is :ntr use oi resonance ionization spectroscopy 
fPJS) in conjunction with a magnetic sector mass analyzer as disclosed m U.S. Patent No 
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4.442.554 io detect the sulfur isotopes corresponding to the specific nucleotide 
terminators, and by this, allowing the assignment of the DNA sequence. 

EPO Patent Applications No. 0360676 Al and 0360677 Al also describe 
Sanger sequencing using stable isotope substitutions in the ddNTP's such as D. 15 N. 
5 17 0 . 18 0 . 32 S< 33 S . 34 S . 36 S , 19 F . 35 Ch 37 CL 79 Bn 81 Br Md 127! or functional groups 
such as CF3 or SiCCI^ ^ at the base, the sugar or the alpha position of the triphosphate 
moiety according to chemical functionality. The Sanger sequencing reaction mixtures are 
separated by tube gel electrophoresis. The effluent is convened into an aerosol by the 
electrospray/thermospray nebulizer method and then atomized and ionized by a hot plasma 

10 (7000 to 8000°K) and analyzed by a simple mass analyzer. .An instrument is proposed 
which enables one to automate the analysis of the Sanger sequencing reaction mixture 
consisting of tube electrophoresis, a nebulizer and a mass analyzer. 

The application of mass spectrometry to perform DNA sequencing bv the 
hybridizatioa / fragment method (see above) has been recently suggested (Bains. "DNA 

15 Sequencing by Mass Spectrometry: Outline of a Potential Future Application." 
Chimicaog gT-9. 13-16(1991)). 

Summary of th e Invention 

The invention describes a new method to sequence DNA. The 

20 improvements over the existing DNA sequencing technologies include high speed, high 
throughput, no required electrophoresis (and. thus, no gel reading artifacts due to the 
complete absence of an electrophoretic step;, and no costly reagents involving various 
substitutions with stable isotopes. The invention utilizes the Sanger sequencins strategy 
and assembles the sequence information by analysis of the nested fragments obtained by 

25 base-specific chain termination via their different molecular masses using mass 

spectrometry, for example, MALDI or ES mass spectrometry. A further increase in 
throughput can be obtained by introducing mass modifications in the oligonucleotide 
primer, the chain-terminating nucleoside triphosphates and/or the chain-elongating 
nucleoside triphosphates, as well as using integrated tag sequences which allow 

30 multiplexing by hybridization of tag specific probes with mass differentiated molecular 
weights. 



Brief Descr iption of the FTG1 T RFS 

FIGURE 1 is a representation of a process to generate the samples to be 
analyzed by mass spectrometry. This process entails insertion of a DNA fraement of 

" ar/ ^ r - - jm:: ...;Kii. r hvj nav.-^pe silica;:;, termmatec nested fragment 

lamily temporarih to a solid support: removing by a washing step al! by-products. 
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conditioning the nested DNA or RNA fragments by. for example, canon-ion exchange or 
modification reagent and presenting the immobilized nested fragments either directly to 
mass spectrometry analysis or cleaving the purified fragment family off the suppon and 
evaporating the cleavage reagent. 
5 FIGURE 2A shows the Sanger sequencing products using ddTTP as 

terminating deoxynucleoside triphosphate of a hypothetical DNA fragment of 50 
nucleotides (SEQ ID NO:? ) in length with approximately equally balanced base 
composition. The molecular masses of the various chain terminated fragments are given. 

FIGURE 2B shows an idealized mass spectrum of such a DNA fragment 

10 mixture. 

FIGURES 3 A and 3B show, m analogy to FIGURES 2A and 2B. data for 
the same model sequence (SEQ ID NO:3) with ddATP as chain terminator. 

FIGURES 4A and 4B show data, analogous to FIGURES 2A and 2B when 
ddGTP is used as a chain terminator for the same model sequence (SEQ ID NO:3 ). 
15 FIGURES 5 A and 5B illustrate the results obtained where chain 

termination is performed with ddCTP as a chain terminator, in a similar way as shown in 
FIGURES 2A and 2B for the same model sequence (SEQ ID NO:3). 

FIGURE 6 summarizes the results of FIGURES 2A to 5B, showing the 
correlation of molecular weights of the nested four fragment families to the DNA 
20 sequence (SEQ ID NO:3). 

FIGURES 7A and 7B illustrate the general structure of mass-modified 
sequencing nucleic acid primers or tag sequencing probes for either Sanger DNA or 
Sanger RNA sequencing. 

FIGURES 8 A and 8B show the general structure for the mass-modified 
25 triphosphates for either Sanger DNA or Sanger RNA sequencing. General formulas of the 
chain-elongating and the chain-terminating nucleoside triphosphates are demonstrated. 

FIGURE 9 outlines various linking chemistries (X) with either 
polyethylene glycol or terminally monoalkylated polyethylene glycol (R) as an example. 

FIGURE 10 illustrates similar linking chemistries as shown in FIGURES 
30 8A and 8B and depicts various mass modifying moieties (R). 

FIGURE 1 1 outlines how multiplex mass spectrometnc sequencing can 
work using the mass-modified nucleic acid primer (UP). 

FIGURE 12 shows the process of multiplex mass spectrometric sequencing 
employing mass-modified chain-elongating and/or terminating nucleoside triphosphates. 
35 FIGURE 13 shows multiplex mass spectrometric sequencing by involving the 

hvbridization of mass-modified tae sequence specific r r ohe^ 



ju:. K:_ ; : ^nows a superposition or MALDI-TOF spectra of the 50-mer 
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d(TAACGGTCATTACGGCCATTGACTGTAGGACCTGCATTACATGACTAGCT) (SEQ 
ID NO:3 ) ( 500 fmol i and dlipdligg i 500 fmol ). 

FIGURES 10A-I6M show ihe MALDI-TOF spectra of all 13 DNA sequences 
representing the nested dT-ierminated fragments of the Sanger DNA sequencing simulation 
5 of Figure 2. 500 fmol each, as follows: 16A is a "-men 16B is a 10-mer: 16C is a 1 1-mer; 
1 6D is a 1 ^-mer; 1 6E is a 20-mer; 1 6F is a 24-mer; 1 6G is a 26-mer; 1 6H is a 33-mer; 1 61 is 
a 37-mer: 16.1 is a 38-mer: 1 6K is a 42-mer: 16L is a 46-mer and 16M is a 50-mer. 

FIGURES !"A and 1 7 B show the superposition of the spectra of FIGURE 16. 
The two panels show two different scales and the spectra analyzed at that scale. Figure 1 7 A 
1 0 shows the superposition of the spectra of 1 6A- 1 6F. The letter above each peak corresponds 
to the original spectra of the fragment in FIGURE 16. For example, peak B corresponds to 
FIGURE 16B: peak C corresponds to FIGURE 16C. etc. 

FIGURE 1 8 shows the superimposed MALDI-TOF spectra from MALDI-MS 
analysis of masb-rnodified oligonucleotides as described in Example 21. 
15 FIGURE 19 illustrates various linking chemistries between the solid support 

(P) and the nucleic acid primer fNA) through a strong electrostaiic interaction. 

FIGURES 20A and 20B illustrate various linking chemistries between the 
solid support TP) and the nucleic acid primer (NA) through a charge transfer complex of a 
charge transfer acceptor (A) and a charge transfer donor (D). 
20 FIGURE 21 illustrates various linking chemistries between the solid support 

(P) and the nucleic acid primer ("NA) through a stable organic radical. 

FIGURE 22 illustrates a possible linking chemistry between the solid support 
(P) and the nucleic acid primer (NA) through Watson-Crick base pairing. 

FIGURE 23 illustrates linking the solid support (P) and the nucleic acid 
25 primer fNA) through a photolytically cleavable bond. 



Detailed Description o f the Invention 

This invention describes an improved method of sequencing DNA. In 
particular, this invention employs mass spectrometry, such as matrix-assisted laser 
30 desorption'ionization (MALDI) or eiectrospray (ES) mass spectrometry (MS), to analyze the 
Sanger sequencing reaction mixtures. 

In Sanger sequencing, four families of chain-terminated fragments are 
obtained. The mass difference per nucleotide addition is 289.19 for dpC. 313.21 for dpA. 
329.21 for dpG and 304.2 for drT. respective! v 



: c ^;,sigr/jj >Lip^rp^s;:; jr. ie.L.. interpolation j o! the molecular weight peaks of the 
four individual experiments. In another embodiment, the molecular weights of the four 
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b> mixing the product, of all four reactions run in at least two separate reaction vessel, 
(i.e.. all run separate]) . or two together, or three together) or bv rurmm 2 one reaction 
havmg all four chain-terminating nucleotides (e.g.. a reaction mixture comprising dTTP 
ddTTP. dATP. ddATP. dCTP. ddCTP. dGTP. ddGTP) i„ one reaction vessel. Bv 
simultaneously analyzing all four base-specifically terminated reaction products.'the 
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molecular weight values have been, in effect, interpolated. Comparison of the mass 
difference measured between fragments with the known masses of each chain-temunatino 
nucleot.de allows the assignment of sequence to be carried out. In some instance* it mav 
be destrable to mass modify, as discussed below, the chain-terminatin 2 nucleotide, so as "to 
expand the difference in molecular weight between each nucleotide. It will be apparent to 
those skilled in the an when mass-modification of the cham-temunatine nucleotides is 
desirable and can depend, for instance, on the resolving ability of the particular 
spectrometer employed. By way of example, it may be desirable to produce four chain- 
terminating nucleotides. ddTTP. ddCTPl. dd.ATP? and ddGTp3 where ddCTPl ddAT^ 
and ddGTp3 have each been mass-modified so as to have molecular weights resolvable ' 
from one another by the particular spectrometer being used. 

The terms chain-elongating nucleotides and chain-terrrunatins nucleotides are 
well known in the an. For DNA. chain-elongating nucleotides include 
2'-deoxyribonucleotides and chain-terminating nucleotides include 
2\ S'-dideoxyribonucieotides. For RNA. chain-elongating nucleotides include 
nbonucelotides and chain-termmatmg nucleotides include 3'-deoxvribonucleotides The 
term nucleotide is also well known in the an. For the purposes of this invention 
nucleotides include nucleoside mono-, di-. and triphosphates. Nucleotides also include 
modified nucleotides such as phosphorothioate nucleotides. 

Since mass spectrometry is a serial method, in contrast to currentlv used 
slab gel electrophoresis which allows several samples to be processed in parallel 'in 
another embodiment of this invention, a further improvement can be achieved bv 
multiplex mass spectrometry DNA sequencing to allow simultaneous sequencing of more 
than one DNA or RNA fragment. As described in more detail below, the range of about 
300 mass units between one nucleotide addition can be utilized by emplovine either mass- 
modified nucleic acid sequencing primers or chain-elongating and/or terminate 
nucleoside triphosphates so as to shift the molecular weight of the base-specificaJlv 
terminated fragments of a panicular DNA or RNA species beine sequenced in a 
predetermined manner. For the first time, several sequencing reactions can be mass 
spectrometncally analyzed in parallel. In yet another embodiment of this invention 
multiplex mass spectrometric DNA sequencing can be performed bv mass modifving the 
fragment families through specific oligonucleotides (tag probes) which hvbridiz- to 
specific tag sequences within each of the fragment families. In another embodiment the 
tag probe can be covalently attached to the individual and specific tag sequence pnor to 
35 mass spectrometn'. 
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spectrometry The molecular weigh: values or preferably at leas: five and more preferabh 
at least ten base-specincally terminated fragments are determined by mass spectrometn * 
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Also included in the invention are determinations of the molecular we.eht values of at l^t 
-0 base-specifically terminated fragments and at least 30 base-specifically terminated 
fragments. Further, the nested base-specifically termmated foments m a specific s-t can 
be purified of all reactarts and by-products but are not separated from one another The 
> entire set of nested base-specifically terminated fragments is analyzed concurrently and the 
molecular weight values are determined. At least two base-specifically termmated 
fragments are analyzed concurrently by mass spectrometry when the fragments are 
contained in the same sample. 

In general, the overall mass spectrometry DNA sequencing process will start 
10 with a library of small genomic fragments obtained after first randomly or specifically 
cutting the genomic DNA into large p.eces which then, in several subcloning steps are 
reauced m size and inserted into vectors like derivatives of M13 or pUC (e g Ml3mpl8 
or M13mpl 9) (see FIGURE 1 ,. In a different approach, the fragments inserted in vectors 
such as M13. are obtained via subcloning starting with a cDNA library. In yet another 

5 approach, the DNA fragments to be sequenced are k„ ,u . 

j viiicidi>c cnain 

T^Z I"'" HlgUChi 61 aL " A GCneraI Meth ° d ° f " Vhr ° ^P^on and Mutagenesis 
ofDNA Fragments: Study of Protein and DNA Interactions." Nucleic AriH« p.. m 

7^1-67 (1988)). As is known in the art. Sanger sequencing can start from one nucleic 
aad primer (UP) binding to the plus-strand or from another nucleic acid primer binding to 
' the opposite minus-strand. Thus, either the complementary sequence of both strands of a 
given unknown DNA sequence can be obtained (providing for reduction of ambiguity in 
the sequence determination) or the length of the sequence information obtainable from one 
clone can be extended by generating sequence information from both ends of the unknot 
vector-inserted DNA fragment. 

The nucleic acid primer carries, preferentially at the 5-end, a linking 
functionality, L, which can include a spacer of sufficient length and which can interact 
with a suitable functionality, L\ on a solid support to form a reversible linkaee such as a 
photocleavable bond. Since each of the four Sanger sequencing families stam with a 
nucleic acid primer (L-UP; FIGURE I ) this fragment family can be bound to the solid 
support by reacting with functional groups. L\ on the surface of a solid support and then 
intensively washed to remove all buffer salts, triphosphates, enzymes, reaction by- 
products, etc. Furthermore, for mass spectrometry analysis, it can be of importance at thus 
stage to exchange the cation at the phosphate backbone of the DNA fragments in order to 
eliminate peak broadening due to a heterogeneity in the cations bound per nucleotide unit 
imce the L-L' linkage is only of a temporary nature with the purpose to capture the nested 
Sanger OK a n-p\; .« . , . 



wasnec. and cleaved off the support for mass spectrometry analysis, the temporary 
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linkage can be such thai is cieaved under the conditions of mass spectromerv i 
photocleavabie bond such as a charge transfer complex or a stable oreanic radical 
Funhermore. the linkage can be formed with L being a quatemar.- ammonium -roun 
<some examples are given in FIGURE 19). In this case, preferabiv. the surface if the -olid 
support carries negative charges which repel the negatively charced nucleic and backbone 
and thus facilitates desorption. Desorption will take place either bv the heat created bv the 
laser pulse and or. depending on L/ by specific absorption of laser enenrv which is in' 
resonance with the U chromophore (see. e.g.. examples civen in FIGURE 1 Q) The 
functionalities. L and L/ can also form a charge transfer complex and therebv form the 
temporary L-L' linkage. Various examples for appropriate functionalities with either 
acceptor or donator properties are depicted without limitation in FIGURES "'OA and -OB 
S.nce in many cases the -charge-transfer band" can be determined bv UV/vis spectrometry 
(see e.g. Ogamc Phr^ TrnnstV Comp ly by R. Foster. Academic Pre<s 1969whe 
laser energy can be tuned to the corresponding energy of the charge-transfer wavelength 
and. thus, a specific desorption off the solid support can be initiated. Those skilled ,n the 
art w,ll recogmze that several combinations can serve this purpose and that the donor 
functionality can be either on the solid support or coupled to the nested Sanaer DN.VRNA 
fragments or vice versa. 

In yet another approach, the temporary linkage L-L' can be generated by 
20 homolytically forming relatively stable radicals as exemplified in FIGURE 2 1 ' In example 4 
of FIGURE 21 . a combination of the approaches using charge-transfer complexes and stable 
organic radicals is shown. Here, the nested Sanger DN.VRNA fragments are captured via the 
formation of a charge transfer complex. Under the influence of the laser pulse, desorption (as 
_ discussed above, as well as ionization will take place at the radical position. In the other 
examples of FIGURE 21 under the influence of the laser pulse, the L-L' linkase will be 
cleaved and the nested Sanger DNA/RNA fragments desorbed and subsequent ionized at 
the radical position formed Those skilled ,n the art will recognize that other oreanic radical, 
can be selected and that. ,n relation to the dissociation energies needed to homoht.callv 
cleave the bond between them, a corresponding laser wavelength can be selected (see e B 
>0 A ctive M o| ft n,l-s by C. Wentrup. John Wiley & Sons. 1984). In yet another approach the 
nested Sanger DNA/RNA fragments are captured via Watson-Cnck base pamne to a solid 
support-bound oligonucleotide complementary- to either the sequence of the nucleic acid 
primer or the tag oligonucleotide sequence (see FIGURE 22). The duplex formed will be 
cleaved unde*- the ^, : : 



phosphotnester backbone, or em ? ioy,ng oligonucleotide mmencs such as PN'A analocs ( set 



s 



e.g. Nielsen c, a I . Science. 254. 1497,1991 „ which render the base 



sequence ]e?- 
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hence increases overall sta,:;,r> of the solid suppon-bound capture base secure W„ n 
ap P ropna,e bonds. L-L. a cleavage can be obtained direc.y v,th , i2scr mned „ 
energy necessary for bond cieavag , Thus. the .mobilized „=s,ed Sanger femem, c- 
be directly ablated during mass spectrometnc analysis " " ~' 

5 To mcrease mass spectrornetnc perforrnar.ee. it m ay be necessan- ,„ modtfl 

*e phosphoaies.er backbone poor ,„ MS analysis. Th,s can be accoTiphsned'bv for ' 

W nh alkylating agents such as akyl.odides, .odoacetamide. (J-iodoethanol. 2 3-epoxv. , 
propano, (see FIGURE 1 0 , the monothio phosphodiester bonds of *e nested SaLer 
ragmen* are tr^sformed ,n,o phosphotriester bonds. Mu„,p,exmg by mass modification 

It T ' S , ■ maSS ' m ° di ^ "* «* P™er (UP , or the nucleoside 

.nphosphates a, the sugar or the base moiety . To those stalled ,„ tne an other 
modifications of the nested Sanger fragments car, be envisioned. ,„ one embod.men, of 

A. invention, the l.nkrng chemistry aUows one to cleave off the so-purified nested DNA 
' enzvmaucallv rrhmi^ih „ - 0 UNA 

a rvr,e „f h :""7" H "' ' ay Way M exan, P' e - ,he L -L' chemisrrv can be of 

2 T ' em,Ca " y C,eaVaWe ' fM b >' — ptoethartoi or 

= oZ« 3 ° i0tia;S,repHVidi " » h-robtfimconal denvative of a tnrv, eU,er 

group poster „ al, "A Versa,i,e Acid-Labile Linker for Modification of Svnthenc' 

B. -o,ecu,es/.I aQtstol ^ il , 7095 (l990)) which can be cIe aved under mi,dlv 
ac,d, c condtuons. . ,_ lin; , group deavaWc ^ amoa m.,dh 

hyd„aceta t = buffer, an arginine-argtnine or ,ysi„e-,ys,ne bond cleavable bv an 
endopepttdase enzyme like trypsin or a pyrophosphate bond cleavable bv a 
pyrophosphau., a pnotocleavable bond which can be, for example, phy'sical.v cleaved 
and the like (see. e.g., FIGURE 23). Optional another cation exchange can be 
performed pnor to mass spectrometry analysis. In the instance that an enzvme-cleavable 
bon ts ut.ltzed to immobile the nested fragments, the enzyme used ,o cleave tn e o n d 
can serve as an internal mass standard during MS analysis. 

„ um K f J hei " lrif ' Ca,i0npr0cessand ' OTi ™ change process can be carried ou, bv a 
number o other methods instead of, or in conjunction with, tmmobilization on a s d 
npport. or example, the base-specficaUy terminated products can be sepald ft 1 th- 
rcactants by calysis. fixation (tncluding ultrafinrat.on,, md chromatography Ukelt 
*ese techmques can be used to exchange the cation of the phosphate backbone w „T 
counter-ton which reduces peak broadening. 

standard , ^ baSe ' SpeC ' llCa ">' Iem '" aled families can be generated bv 
standard Sanger sequencing using the Large Klenow fragment off <-„., nK . ^ 



DN A Z; " Kiav '- '**-**«»°»x> wmmrnni RSa transcnois of the 

•i .n.-. also be utilizes for mass spectrometric seouence 
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determination. In this case, various RNA polymerases such as the SP6 or the T7 RNA 
polymerase can be used on appropriate vectors containing, for example, the SP6 or the T~ 
promoters (e.g. Axelrod ei a!. 'Transcription from Bacteriophage T7 and SP6 RNA 
Polymerase Promoters in the Presence of 3'-Deoxyribonucleoside S'-triphosphate Cham 
5 Terminators." Biochemistry 24. 5716-23 (1985)). In this case, the unknown DNA 

sequence fragments are inserted downstream from such promoters. Transcription can also 
be initiated by a nucleic acid primer (Pitulle et al, "Initiator Oligonucleotides for the 
Combination of Chemical and Enzymatic RNA Synthesis/' Gene 1 12 . 101-105 (1992)) 
which carries, as one embodiment of this invention, appropriate linking functionalities. L. 
10 which allow the immobilization of the nested RNA fragments, as outlined above, prior to 
mass spectrometry analysis for purification and/or appropriate modification and or 
conditioning. 

For this immobilization process of the DN A/RNA sequencing products for 
mass spectrometry analysis, various solid supports can be used, e.e.. beads (silica cel. 
15 controlled pore glass, magnetic beads. Sephadex/Sepharose beads, cellulose beads, etc.), 
capillaries, glass fiber filters, glass surfaces, metal surfaces or plastic material. Examples 
of useful plastic materials include membranes in filter or microliter plate formats, the latter 
allowing the automation of the purification process by employing microliter plates which, 
as one embodiment of the invention, carry a permeable membrane in the bottom of the 

20 well functionalized with L\ Membranes can be based on polyethylene, polypropylene, 
polyamide, polyvinylidenedifluoride and the like. Examples of suitable metal surfaces 
include steel, gold, silver, aluminum, and copper. After purification, cation exchange, 
and/or modification of the phosphodiester backbone of the L-L' bound nested Saneer 
fragments, they can be cleaved off the solid support chemically, enzymatically or 

25 physically. Also, the L-L' bound fragments can be cleaved from the support when they are 
subjected to mass spectrometry analysis by using appropriately chosen L-L' linkages and 
corresponding laser energies/intensities as described above and in FIGURES 19-23. 

The highly purified, four base-specifically terminated DNA or RNA 
fragment families are then analyzed with regard to their fragment lengths via 

30 determination of their respective molecular weights by MALDI or ES mass spectrometrv. 

For ES. the samples, dissolved in water or in a volatile buffer, are injected 
either continuously or discontinuously into an atmospheric pressure ionization interface 
(.API) and then mass analyzed by a quadrupole. With the aid of a computer program, the 
molecular weight peaks are searched for the known molecular weight of the nucleic acid 

35 primer CUP) and determined wb<"~- -♦"♦^ - . ... . 

■ •' ■ ■ - - ■■ '■■ — ' --^ rMUu.; -u.:: ' >. :>::t:; 4 ^_ .i^nnc: 

mis. the nucleotide sequence :s assigned. The generation of multiple ion peaks which can 
be obtained using ES mass spectrometry can increase the accuracv of the mas^ 
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determinauon. 

In MALDI mas 5 spectrometry', various mass analyzers can be used. e.g.. 
magnetic sector magnetic deflection instruments in single or triple quadrupoie mode 
(MS/MS). Fourier transform and ume-of-flight (TOF) configurations as is known in the 
5 an of mass spectrometry. FIGURES 2A through 6 are given as an example of the data 
obtainable when sequencing a hypothetical DNA fragment of 50 nucleotides in length 
(SEQ ID NO:3 ) and having a molecular weight of 15.344.02 daltons. The molecular 
weights calculated for the ddT (FIGURES 2A and 2B). ddA (FIGURES 3 A and 3B). ddG 
(FIGURES 4A and 4B) and ddC (FIGURES 5A and 5B) terminated products are given 

10 (corresponding to fragments of SEQ ID NO:3) and the idealized four MALDI-TOF mass 
spectra shown. All four spectra are superimposed, and from this, the DNA sequence can 
be generated. This is shown in the summarizing FIGURE 6. demonstrating how the 
molecular weights are correlated with the DNA sequence. MALDI-TOF spectra have 
been generated for the ddT terminated products (FIGURES 16A-16M) corresponding to 

15 those shown in FIGUF.E 2 and these spectra have been superimposed (FIGURES 17A and 
1 7B). The correlation of calculated molecular weights of the ddT fragments and their 
experimentally-verified w eights are shown in Table 1. Likewise, if all four chain- 
terminating reactions are combined and then analyzed by mass spectrometry, the 
molecular weight difference between two adjacent peaks can be used to determine the 

20 sequence. For the desorptioa'ionization process, numerous matrix/laser combinations can 
be used. 

TABLE I 

-5 Correlation of calculated and experimentally verified molecular weights of the 13 DNA 

fragments of FIGURES 2 and 1 6 A- 1 6M. 



Fragment (n-mer) 


calculated mass 


experimental mass 


difference 


7-mer 


2104.45 


2119.9 


-15.4 


1 0-mer 


301 1.04 


3026.1 


-15.1 


1 1 -me: 


3315.24 


3330.1 


-14.9 


19-mer 


5771.82 


5788.0 


-16.2 


20-mer 


6076.02 


6093.8 


-17.8 


24-mer 


731 1.82 


7374.9 


-63.1 


26-mer 


7945.22 


7960.9 


-15.7 


33-mer 


101 12.63 


10125.3 


-12.7 


37-mer 


1 1348.43 


11361.4 


-13.0 


38-mer 


1 1652.63 


i l -> 
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In oraer 10 increase throughput to a Jex el necessary for hich volume oenomic 
and cDN A sequencing rro;ect,. a funher embodiment of the present invention is to utili- 
multipiex mass spectrometry to simultaneous!) determ.ne more than one sequ-n- Thi< 
can be achieved by several, albeit different, methodologies, the basic principle beino :h e" 
mass moditication of the nucleic acid primer (UP), the chain-elonsatino and or 
terminating nucleoside triphosphates, or by using mass-differentiated tae probe, 
hybndizable to specific tag sequences. The term "nucleic acid primer" a~s used he~in 
encompasses primers for both DNA and RNA Sanger sequencing. 

By way of example. FIGURE 7 A presents a general formula of the nucleic 
acd pnmer (UP) and the tag probes (TP). The mass modifying moietv can be attach-d 
for instance, to either the 5-end of the oligonucleotide ( M l to the nucleobase ,or bases; 
(M-. M ). to the phosphate backbone (M?). and to the 2'- P osit>on of the nucleoside 
■ nucleosides,,!^. M*. or and t0 lhe lermmaI 3,^^ (m5) ^ ^ ^ ^ 

uetw.cn 1 and 50 nucleotides in length. For the priming of DN'A Saucer sequencing the 
pnmer is preferentially in the range of about 15 to 30 nucleotides m leneth For 
artificially priming the transcription in a RNA polymerase-mediated Sanser seauencin* 
reaction, the length of the primer is preferentially in the range of about 2 to 6 nucleotides 
If a tag probe (TP) is to hybridize to the integrated tag sequence of a familv chain- 
terminated fragments, its preferential length is about 20 nucleotides. 

The table in FIGURE 7B depicts some examples of mass-modified 
primemag probe configurations for DNA. as well as RNA. Saneer sequent Thas list 
is. however, not meant to be limiting, since numerous other combinations of mass- 
mod.fying funct.ons and positions within the oligonucleotide molecule are possible and 
are deemed pan of the invention. The mass-modifying functionality can be. for example 
a halogen, an az.do. or of the type. XR. wherein X is a l.nkine eroup and R is a mass- 
modifying functionality. The mass-modifying functionality can thus be used to introduce 
defined mass increments into the oligonucleotide molecule. 

In another embodiment, the nucleotides used for chain-eloncation and/or 
termination are mass-modified. Examples of such modified nucleotides are shown in 
FIGURE SA and SB. Here the mass-modifying moiety. M. can be attached either to the 
nucleobase. M- (in case of the c7-deazanucleosides also to C-7. M\ to the tnphosphate 
group at the alpha phosphate. M-\ or to the 2'-posit,on of the sugar nn 2 of the nucleoside 
triphosphate. M 4 and M 6. Funhermore. the mass-modifying functionality can be added so 
as u- affect chain termination ^.-ch 3^ k v -,r--, ;. . v ,„ - „ . . ; , ; _ ... 



SeqUenCin -- For SK:,iec ln the however, it is clear mat mam oth- 
:omo,nat,ons can serve the purpose of the invention equally well. In the same way. those 
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mass-modified in a similar fashion with numerous variations and combinations m 
functionality and attachment positions. 

Without limiting the scope of the invention. FIGURE 9 gives a more detailed 
description of particular examples of how the mass-modification, M, can be introduced for 
5 X in XR as well as using oligo- polyethylene glycol derivatives for R. The mass- 
modifying increment in this case is 44. i.e. five different mass-modified species can be 
generated by just changing m from 0 to 4 thus adding mass units of 45 (m=0). 89 (m=l ). 
133 (m=2). 177 (m=3) and 221 (m=4) to the nucleic acid primer (UP), the tag probe (TP) 
or the nucleoside triphosphates respectively. The oligo/polyethylene glycols can also be 
10 monoalkylated by a lower alkyl such as methyl. ethyl, propyl, isopropyl. t-butyl and the 
like. A selection of linking functionalities. X. are also illustrated. Other chemistries can 
be used in the mass-modified compounds, as for example, those described recently in 
Oligonucleotides and .Analogues. A Practical Approach . F. Eckstein, editor, IRL Press, 
Oxford. 1991. 

15 In yet another embodiment, various mass-modifying functionalities. R, other 

than oligo/polyethylene glycols, can be selected and attached via appropriate linking 
chemistries, X. Without any limitation, some examples are given in FIGURE 10. A 
simple mass-modification can be achieved by substituting H for halogens like F, CI, Br 
and/or L or pseudohalogens such as SCN, NCS, or by using different alkyl, aryl or aralkyl 

20 moieties such as methyl, ethyl, propyl, isopropyl, t-butyl. hexyl, phenyl, substituted 
phenyl, benzyl, or functional groups such as CH2F, CHF2, CF3, Si(CH3)3, 
Si(CH3)2(C2H5), Si(CH3)(C2H5)2. Si(C2H5)3 . Yet another mass-modification can be 
obtained by attaching homo- or heteropeptides through X to the UP, TP or nucleoside 
triphosphates. One example useful in generating mass-modified species with a mass 

25 increment of 57 is the attachment of oligoglycines. e.g., mass-modifications of 74 (r=l, 

m=0), 131 (r=l, m=2), 188 (r=l, m=3), 245 (r=l. m=4) are achieved. Simple oligoamides 
also can be used, e.g., mass-modifications of 74 (p=l, m=0), 88 (r=2, m=0), 102 (r=3, 
m=0), 1 16 (r=4, m=0). etc. are obtainable. For those skilled in the art, it will be obvious 
that there are numerous possibilities in addition to those given in FIGURE 1 0 and the 

30 above mentioned reference ( Oligonucleotides and .Analogues . F. Eckstein, 1991). for 

introducing, in a predetermined manner, many different mass-modifying functionalities to 
UP, TP and nucleoside triphosphates which are acceptable for DNA and RNA Sanger 
sequencing. 

As used herein, the superscript 0-i designates i + 1 mass differentiated 
35 nucleotides, primers or tags. In some instances, the superscript 0 (e.g.. NTpO, UPO) can 

desi en at f ? ^ ' ( ^ r * n ^ A * ~ ~ ~ ' - ■ ■ ■ • > 

'^.Di'v. rr-prt- man d:vj .s?fr!^ ■•: nucie;^ . „ . t . JN A dories 1 are tu oe concurrent!) 

sequenced by multiplex DNA sequencing, then 1 - i different mass-modified nueieic acid 
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primers « fpO. UPl-...UP' , can oe used to distinguish each set of base-specificain 
terminated fragments, uherein eacn spec.es of mass-modnled UPi can be distinguished bv 
mass spectrometry from the res:. 

As illustrative embodiments of this invention, three different bas 1C processes 
> for multiplex mass spectrometry DNA sequencing employing the described mass- 
modified reagents are described below: 

A ) Multiplexing by the use of mass-modified nucleic acid 
primers (UP) for Sanger DNA or R_\'A sequencing (see for example 
FIGURE 1 1 ): 

B) Multiplexing by the use of mass-modified nucleoside 
triphosphates as chain elongators and/or chain terminators for Sanger 
DNA or RNA sequencing (see for example FIGURE 12); and 

C) Multiplexing by the use of tag probes which specifically 
hybridize to tag sequences which are integrated into pan of the four 
Sanger DNA/RNA base-specifically terminated fragment families. 
Mass modification here can be achieved as described for FIGURES 7a. 
7B. 9 and 1 0. or alternately, by designing different oligonucleotide 
sequences having the same or different length with unmodified 
nucleotides which, in a predetermined way, generate appropriately 
differentiated molecular weights (see for example FIGURE 13). ' 

The process of multiplexing by mass-modified nucleic acid primers (UP) is 
illustrated by way of example in FIGURE 1 1 for mass analyzin* four different DNA 
clones simultaneously. The first reaction mixture is obtained by standard Saneer DNA 
sequencing having unknown DNA fragment 1 (clone 1) integrated m an appropriate vector 
(e.g.. Ml 3mpl 8). employing an unmodified nucleic acid primer UPO. and a standard 
mixture of the four unmodified deoxynucleoside triphosphates. dNTpO, and with l/10th 
of one of the four dideoxynucleoside triphosphates. ddNTpO. A second reaction mixture 
for DNA fragment 2 (clone 2 , is obtained by employing a mass-modif.ed nucleic acid 
primer UP' and. as before, the four unmodified nucleoside triphosphates. dNTpO 
containing in each separate Sanger reaction l/]0th of the chain-terminating unmodified 
dideoxynucleoside triphosphates ddNTpO. ]n the other two experiments, the four Sanger 
reactions have the following compositions: DNA fraement 3 (clone 3) UP? dNTpO " 
ddNTpO and DNA fragment 4 (clone 4). Tjp3. dNTP 0. ddNTP 0. For mass spectrometnc 

DNA sequencing, all base-specifically terminated reacnons of the four clones are pooled 

and mas^ anahved TV 



m °' eCUiar pe3KS ot lP ■ 1 pi - and UP-* extended bs either one of the four 
aiaeoxynuc.eos.de triphosphates. UP0-dd\0. UPL ddN 0. uP--dd\0 and UP-"-dd\0 }• 
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this way. the first nucleotides of the four unknown DNA sequences of clone 1 to 4 are 
determined. The process is repeated, having memorized the molecular masses of the four 
specific first extension products, until the four sequences are assigned. Unambiguous 
mass/sequence assignments are possible even in the worst casr scenario in which the four 
mass-modified nucleic acid primers are extended by the same dideoxynucleoside 
triphosphate, the extension products then being, for example. UP°-ddT, UP J -ddT. UP-- 
ddT and UP3-ddT. which differ by the known mass increment differentiating the four 
nucleic acid primers. In another embodiment of this invention, an analogous technique is 
employed using different vectors containing, for example, the SP6 and/or T7 promoter 
sequences, and performing transcription with the nucleic acid primers UP 0 . UP I. UP- and 
UP3 and either an RNA polymerase (e.g.. SP6 or T7 RNA polymerase) with chain- 
elongating and terminating unmodified nucleoside triphosphates NTP° and 3'-dNTP°- 
Here, the DNA sequence is being determined by Sanger RNA sequencing. 

FIGURE 12 illustrates the process of multiplexing by mass-modified chain- 
eiongatmg or/and terminating nucleoside triphosphates in which three different DNA 
fragments (3 clones) are mass analyzed simultaneously. The first DNA Sancer sequencing 
reaction (DNA fragment 1. clone 1) is the standard mixture employing unmodified nucleic' 
acid primer UpO, dNT p0 and in each of the four reactions one of the four ddNTpO. The 
second (DNA fragment 2, clone 2) and the third (DNA fragment 3. clone 3) have the 
following contents: UP 0 , dNTP° ddNTPl and UP 0 , dNTPO, ddNTP? , respectively. In a 
variation of this process, an amplification of the mass increment in mass-modifying the 
extended DNA fragments can be achieved by either using an equally mass-modified 
deoxynucleoside triphosphate (i.e.. dNTPl, dNTP?) for chain elongation alone or in 
conjunction with the homologous equally mass-modified dideoxynucleoside triphosphate. 
For the three clones depicted above, the contents of the reaction mixtures can be as 
follows: either UPO/dNTpO/ddNTpO, UPO/dNTPl/ddNTpO and U?0/ dNTP 2 /ddNT pO or 
UPO/dNTpO/ddNTpO, UpO/dNTPl/ddNTPl and UpO/dNTp2/ddNTP2. As described 
above, DNA sequencing can be performed by Sanger RNA sequencing employing 
unmodified nucleic acid pnmers. UP 0 , and an appropriate mixture of chain-elongltins and 
terminating nucleoside triphosphates. The mass-modification can be again either in the 
chain-terminating nucleoside triphosphate alone or in conjunction with mass-modified 
chain-elongating nucleoside triphosphates. Multiplexing is achieved by pooling the three 
base-specifically terminated sequencing reactions (e.g., the ddTTP terminated products) 
and simultaneously analyzing the pooled products by mass spectrometry. Again, the first 
extension products of the known nucleic acid prime- sentience r^e or,.--.,- „ .. . .-, , 



all tnree clones, i ne ioliowmc configurations Uius obtained can be well differentiated by 
their different mass-modifications: UP°-ddT°. UpO-ddT 1 . UP°-ddT- 
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In > et another embodiment of this invention. DNA sequencing by multiplex 
mass spectrometry can be achieved b> cioning the DNA fragments to be sequenced in 
"plex-veciors" containing vector specific "tag sequences" as described (Koster c: a!.. 
"Oligonucleotide Synthesis and Multiplex DNA Sequencing Using Chemiluminesceni 
5 Detection/' Nucleic Acids Res . Symposium Ser. No. 24. 3 18-321 ( 1 99 1 )): then pooling 
clones from different plex-vectors for DNA preparation and the four separate Sanger 
sequencing reactions using standard dNTP^/ddXTP^ and nucleic acid pnmer LT^; 
purifying the four multiplex fragment families via linking to a solid support through the 
linking group. L. at the 5'-end of UP: washing out all by-products, and cleaving the 

10 purified multiplex DNA fragments off the support or using the L-L' bound nested Sanger 
fragments as such for mass spectrometric analysis as described above: performing de- 
multiplexing by one-by-one hybridization of specific "tag probes": and subsequentlv 
analyzing by mass spectrometry (see. for example. FIGURE 13). As a reference point, the 
tour base-specifically terminated multiplex DNA fragment families are run by the mass 

1 :> spectrometer and ai 1 ddT 0 -. ddA 0 -. ddC°- and ddG° -terminated molecular ion peaks are 
respectively detected and memorized. Assignment of. for example. ddT^-terminated DNA 
fragments to a specific fragment family is accomplished by another mass spectrometric 
analysis after hybridization of the specific tag probe (TP) to the corresponding tag 
sequence contained in the sequence of this specific fragment family. Only those 

20 molecular ion peaks which are capable of hybridizing to the specific tag probe are shifted 
to a higher molecular mass by the same known mass increment (e.g. of the tag probe). 
These shifted ion peaks, by virtue of all hybridizing to a specific tag probe, belong to the 
same fragment family. For a given fragment family, this is repeated for the remaining 
chain terminated fragment families with the same tag probe to assign the complete DNA 

25 sequence. This process is repeated i-1 times corresponding to i clones multiplexed (the 
i-th clone is identified by default). 

The differentiation of the tag probes for the different multiplexed clones can 
be obtained just by the DNA sequence and its ability to Watson-Crick base pair to the tag 
sequence. It is well known in the an how to calculate stringency conditions to provide for 

30 specific hybridization of a given tag probe with a given tag sequence (see, for example. 
Molecular Cloning: A laboratory manual 2ed. ed. by Sambrook. Fritsch and Maniatis 
(Cold Spring Harbor Laboratory Press: NY, 1989, Chapter 1 1). Furthermore, 
differentiation can be obtained by designing the tag sequence for each plex-vector to have 
a sufficient mass difference so a- To be unioue h\- cban^inr the lenrrh o- 

spectrometric analysis. is another embodiment or the invention to provide for a covaien: 
attachment mediated b_\ . for example, photoreactive groups such as psoralen and 
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Helene ,/ Xaiure ±u 55S , 1 QQo, and Thuong er al "Oligonucleotides Attached to 
Intercalates. Photoreacm e and Cleavage Agents" in F. Eckstein. Olioonuc.^,H^ ^ 
Analogues: A Practical \? V ^cY . IRi p reS s. Oxford 1 991 . 285-306). 

The DNA sequence is unraveled again by searching for the lowest molecular 
J weight molecular ,on peak corresponding to the known UpO-ta B sequence-tag probe 
molecular weight plus the first extension product, e.g.. ddTO, then the second the third 
etc. 

In a combination of the latter approach with the previously described 
multiplexing processes, a further increase in multiplexing can be achieved bv usin* in 
addition to the tag probe tag sequence interaction, mass-modified nucleic acid primers 
(FIGL'RES 7A and 7B) and or mass-modified deoxynucleoside. dNTpO-i- and/or 
dideoxynucleoside triphosphates. ddNTpO-i. Those skilled in the an will realize that the 
tag sequence tag probe multiplexing approach is not limited to San 2 er DNA seouencino 
generating nested DNA fragments with DN.A polymerases. The DNA sequence can als^ 
uc determined by transcribing the unknot DNA sequence from appropriate promoter- 
containing vectors (see above, with various RNA polymerases and mixtures of NTpO-i*'- 
dNTPU-i. thus generating nested RNA fragments. 

In yet another embodiment of this invention, the mass-modifVin* 
funct,onality can be introduced by a two or multiple step process. In this case! the nucleic 
acd primer, the chain-elongating or terminating nucleoside triphosphates and/or the tas 
probes are. ,n a first step, modified by a precursor functionalitv such as azido -N-> or ~ 
modified with a functional group in which the R in XR is H (FIGURES 7A, 7B 9")' thus 
providing temporary functions, e.g.. but not limited to -OH. -NH-? -NHR -SH NCS 
-OCO(CH 2 ) r COOH (r = 1-20,. -NHCO(CH 2 ) r COOH (r = 1-20),"-OSO->OH 
-OCO(CH 2M (r « 1 -20 , -OP(0-Alkyl)N(Alkvl) 2 . These less bulky functionalities result 
•n better substrate propen.es for the enzymatic DNA or RNA synthesis reactions of the 
DNA sequencing process. The appropriate mass-modifVine functionalitv is then 
introduced after the generation of the nested base-specifically terminated DNA or RNA 
fragments pr.or to mass spectrometry. Several examples of compounds which can serve 
as mass-modifymg functionalities are depicted m FIGURES 9 and 10 without limitine the 
scope of this invention. 

Another aspect of this invention concerns kits for sequencine nucleic acids 
by mass spectrometry which include combinations of the above-described sequencine 
reactants. For instance, in one embodiment, the kit comprises reactants for multiplexes 
spectrometnc sequencine of several different specie* nfn„ r |~- - 
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dCTP and dTTP. or ATP. CTP. OTP and LTP): a set of chain-terminatine nucleotides 
(such as 2'.3'-dideox> nucleotides for DN'A synthesis or 3'-deoxynucleotides for RNA 
synthesis): and an appropriate polymerase for synthesizing complementary nucleotides. 
Primers and/or terminating nucleotides can be mass-modified so that the base-specificailv 
5 terminated fragments generated from one of the species of nucleic acids to be seouenced' 
can be distinguished by mass spectrometry from all of the others. Alternative to the use of 
mass-modified synthesis reactants. a set of tag probes ( as described above) can be 
included in the kit. The kit can also include appropriate buffers as well as instructions for 
performing multiplex mass spectrometry to concurrently sequence multiple species of 
10 nucleic acids. 

In another embodiment, a nucleic acid sequencing kit can compose a solid 
support as described above, a primer for initiating synthesis of complementary nucleic 
acid fragments, a set of chain-elongating nucleotides and an appropriate polymerase The 
mass-modified chain-terminating nucleotides are selected so that the addition of one of the 
15 cham terminators to a growing complementary nucleic acid can be distinsuished bv mass 
spectrometry. 
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Immobilization of primer-extension products of Sanger DNA sequencing reaction for 
mass spectrometric analysis via disulfide bonds. 

As a solid support. Sequelon membranes (Millipore Corp., Bedford MA) 
with phenyl isothiocyanate groups are used as a starting material. The membrane' disks 
with a d,ameter of 8 mm. are wetted with a solution of N-methvlmorpholine/waterO- 
propanol (NMM solution) (2/49/49 v/v/v). the excess liquid removed with filter paper and 
placed on a p,ece of plastic film or aluminum foil located on a heating block set to 55<>C 
A solution of 1 mM 2-mercaptoethylamine (cvsteamine) or 2. 2'-dithio-bis(ethvlamine) 
(cystamme) or S-(2-thiopyridyl)-2-thio-ethylamine (10 ul, 10 nmol) in NMM is added per 
disk and heated at 55°C. After 15 min. 10 ul of NMM solution are added per disk and 
heated for another 5 mm. Excess of isothiocyanate groups mav be removed bv treatment 
with 10 ul of a 10 mM solution of glycine m NMM solution. For cvstamine. the disks are 
treated with 10 ul of a solution of 1M aqueous dithiothreitol (DTT)/2-pro P anol (1 ■] v/v) 
for 1 D mm at room temperature. Then, the disks are thoroughly washed m a filtration 
manifold with 5 aliquots of 1 ml each of the NMM solution, then with 5 aliquots of 1 ml 



solut.cn. The primer oligonucleotides with 5 -SH functionality can be prepared bv'vanou- 
metnods (e.g.. B.C.F Chu et ai. Nucleic AHnc p„ m. 5591-560? H9R6). Srroa-' e- - 
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Nucleic Acids Res. 1>. 4S3"-48 ( 1987) and Oligonucleotides and Arrsloay ^ s; A Practical 
Approach (F. Eckstein, editor). 1RL Press Oxford. 1 99 1 ). Sequencing reactions according 
to the Sanger protocol are performed in a standard way (e.g.. H. Swerdlow er ai. 
Nucleic Acids Re< 18. 1415-19 (1990)). In the presence of about 7-10 mM DTT the free 
5 5-thiol primer can be used: in other cases, the SH functionality can be protected, e.g., bv a 
trityl group during the Sanger sequencing reactions and removed prior to anchoring to the 
support in the following way. The four sequencing reactions (150 ul each in an Eppendorf 
tube) are terminated by a 10 min incubation at 70°C to denature the DNA polvmerase 
(such as Klenow fragment. Sequenase) and the reaction mixtures are ethanol precipitated. 
10 The supematants are removed and the pellets vonexed with 25 ul of an IM aqueous silver 
nitrate solution, and after one hour at room temperature, 50 ul of an 1 M aqueous solution 
of DTT is added and mixed by vortexing. After 15 min, the mixtures are centrifuged and 
the pellets are washed twice with 100 ul ethylacetate by vortexing and centrifucation to 
remove excess DTT. The pnmer extension products with free 5-thiol group are now 
15 coupled to the thiolated membrane supports under mild oxidizing conditions. In general, 
it is sufficient to add the 5-thiolated primer extension products dissolved in 10 ul 10 mM 
de-aerated triethylammonium acetate buffer (TEAA) pH 7.2 to the thiolated membrane 
supports. Coupling is achieved by drying the samples onto the membrane disks with a 
cold fan. This process can be repeated by wetting the membrane with 10 ul of 10 mM 
20 TEAA buffer pH 7.2 and drying as before. When using the 2-thiopyridyl derivatized 
compounds, anchoring can be monitored by the release of pyridine-2-thione 
spectrophotometrically at 343 nm. 

In another variation of this approach, the oligonucleotide primer is 
functionalized with an amino group at the 5'-end which is introduced by standard 
25 procedures during automated DNA synthesis. .After primer extension, during the Sanger 
sequencing process, the primary amino group is reacted with 3-(2-pyridyldithio) propionic 
acid N-hydroxysuccinimide ester (SPDP) and subsequently coupled to the thiolated 
supports and monitored by the release of pyridyl-2-thione as described above. After 
denaturation of DNA polymerase and ethanol precipitation of the sequencing products, the 
30 supematants are removed and the pellets dissolved in 1 0 ul 1 0 mM TEAA buffer pH 7.2 
and 10 ul of a 2 mM solution of SPDP in 10 mM TEAA are added. The reaction mixture 
is vonexed and incubated for 30 min at 25°C. Excess SPDP is then removed by three 
extractions (vortexing, centrifugation) with 50 ul each of ethanol and the resulting pellets 
are dissolved in 10 ul 10 mM TEAA buffer pH 7.2 and coupled to the thiolated supports 
35 (see above). 

Duller pH . ihe purified pnmer-extension products are released bv three 
successive treatments with 10 ui of 10 mM 2-mercaptoethanoi in 10 mM TEAA buffer pH 
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7.2. lyophilized and analyzed by either ES or MALDI mass spectrometry. 

This procedure can also be used for the mass-modified nucleic acid pnmers 
UP 0 " 1 in an analogous and appropriate way, taking into account the chemical properties of 
the mass-modifying functionalities. 



EXAMPLE 



Immobilization of primer-extension products of Sanger DNA sequencing reaction for 
mass spectrometric analysis via the levulinyl group 

5-Aminole\iilinic acid is protected at the primary amino group with the 
Fmoc group using 9-fluorenylmethyl N-succinimidyl carbonate and is then transformed 
into the N-hydroxysuccinimide ester (NHS ester) using N-hydroxysuccinimide and 
dicyclohexyl carbodiimide under standard conditions. For the Sanger sequencing 

reactions, nucleir ariH nrimc«rc T TpO-i Qr» n»H <i^^k ^» a, — i . 

— r — • «iuni v. iLun.nunaiiz.cu wiui a pnmarv 

15 amino group at the 5'-end introduced by standard procedures during automated DNA 
synthesis with aminoiinker phosphoamidites as the final synthetic step. Sanser 
sequencing is performed under standard conditions (see above). The four reaction 
mixtures (150 ul each in an Eppendorf tube) are heated to 70°C for 10 min to inactivate 
the DNA polymerase, ethanol precipitated, centrifuged and resuspended in 10 ul of 10 mM 
TEAA buffer pH 7.2. 1 0 ul of a 2 mM solution of the Fmoc-5-arninolevuIinyl-NHS ester 
in 10 mM TEAA buffer is added, vortexed and incubated at 25°C for 30 min. The excess 
of the reagent is removed by ethanol precipitation and centrifugation. The Fmoc group is 
cleaved off by resuspending the pellets in 10 ul of a solution of 20% piperidine in N.N- 
dimethylformamide/water (1:1 v/v). After 15 min at 25°C, piperidine is thoroughly 
removed by three precipitations/centrifugations with 100 ul each of ethanol. the pellets are 
resuspended in 10 ul of a solution of N-methylmorpholine, 2-propanol and water (2/10/88 
v/v/v) and are coupled to the solid support carrying an isothiocyanate group. In the case of 
the DITC-Sequelon membrane (Millipore Corp., Bedford. MA), the membranes are 
prepared as described in EXAMPLE 1 and coupling is achieved on a heating block at 
55°C as described above. RNA extension products are immobilized in an analogous way. 
The procedure can be applied to other solid supports with isothiocyanate groups in a 
similar manner. 

The immobilized primer-extension products are extensively washed three 
times with 100 ul each of NMM solution and three times with 100 ul 10 mM TEAA buffer 
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Imntobilization of primer-extensior, products of Sanger DNA sequencing reacon ,„ r 
mass spectrometry analysis via a trypsin sensitive linkage 

5 D Jf , SeqUe '° n D ' TC mmt,nffle d,Sk5 ° f 8 "™ d ™ elCT 'Millipcre Corp 

Bedford MA, are wetted with 1 0 ul of NMM solution f^methvimorphoHneproparaol- 
2..wa,er; 2/49*49 and a Hnlter arm introduced by reaction w lt h To u, of a ,0 Z 
solution ofl.e-diaminohexane in NMM. The excess diamine is removed bv three 

1 00 u, of NMM solution. Us,„ s standard peptide synthesis protocol 
» wo L-K,,ne residues are attached by two successive condensations w im N-FmocN-tBoc- 
NmT rt 't ° r0Pheny,eS, = r ' Ihe F"oc group is removed wiu, pipeline ,n 

Excess DITC ,s removed by three washing steps w,th 1 00 ul 2-propanol and the N-tBoc 

groups removed with trifiuoroacet.c acid according ,„ standard peptide svnthes, 

process. , „e nucie.c acid primer-extension products are prepared from 

''^nucleotides which cany a prima, amino group a, the SVerminus. The four Santter 

DNA se uenctng reaction mixtures (.50 u, each in Eppendorf babes, are heated for 

a, 70OC to inactivate the DNA p ol ymerase, ethanol precipitated, a,d me pellets 

resus P=nded in 10 ulofa solution of N-methvlmomhoIir,, i „ m , , 
v/v/vi -n,- i '""lyimorpnoline. 2-propanol and water (2/1 0/88 

v/v/v). This solution is transferred to the I m r v mxi- 

ah M ,;„ mi " s Lys " DITC mcmb ™» disks and coupled on 

jzszr a * c Afer dr> ' m8 - ,o ui of nmm — * — - - 

.00 u e a h of NMM solution and three times w.th 100 ul each of ,0 mM TEAA buffer 

Ldlh"' yj"" SPCC,r0n,5trlC bond berween me primer-ex.ens,on produce 

«d fte solid support ,s cleaved by treatment with trypsin under standard conditions td 
^released products ana,yzed by either ES or MALD1 mass spectrometry w.th tr^n 
serving as an internal mass standard. P 

EXAMELL4 

Immobilization of primer-extension products of Sanger DNA sequent reaction for 
mass spectrometric analysis via pyrophosphate linkage * 

■ . k „ . DITC Sequdon me ™brane (disks of 8 mm d.ameter) are prepared as 

described in EXAMPLE 3 and 10 ul of a 10 mM sohno r „?,^L 



" — ^ ^- x -ess penoaate is removed and th~ nnmpr 
extension products of the fou- San^ n\."A r ■ P-imer- 

bans "' DNA sequencing reactions r 1 50 u! each ir 
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Eppendorf tubes) employing nucleic acid primers with a primary ammo group at the 5'- 
end are ethanol precipitated, dissolved in 1 0 ul of a solution of N-methylmorphoiine 2- 
propanol/water (2 10 '88 v/v vi and coupled to the 2' 3-dialdehydo groups of the 
immobilized NAD analog. 
5 The pnmer-extension products are extensively washed with the NMM 

solution (3 times with 100 ul each ) and 10 mM TEAA buffer pH 7.2 (3 times with 100 ul 
each) and the purified primer-extension products are released by treatment with either 
NADase or pyrophosphatase in 10 mM TEAA buffer at pH 7.2 at 37°C for 15 mm. 
lyophilized and analyzed by either ES or MALDI mass spectrometry, the enzymes serving 
10 as internal mass standards. 

EXAMPLE 5 

Synthesis of nucleic acid primers mass-modified by glycine residues at tbe 5 -position 
15 of the sugar moiety of the terminal nucleoside 

Oligonucleotides are synthesized by standard automated DNA synthesis 
using B-cyanoethylphosphoamidites (H. Koster ei ai. Nucleic Acids Res. 12 . 4539 (1984)) 
and a 5'-amino group is introduced at the end of solid phase DNA synthesis (e.g. Agrawal 
ei ai. Nucleic Acids Res. 14 . 6227-45 (1986) or Sproat et al. Nucleic Acids Res. 15. 

20 6181-96 (1987)). The total amount of an oligonucleotide synthesis, starting with 0.25 

umol CPG-bound nucleoside, is deprotected with concentrated aqueous ammonia, purified 
via OligoPAKTM Cartridges (Millipore Corp.. Bedford, MA) and lyophilized. This 
material with a 5'-terminal amino group is dissolved in 100 ul absolute N,N- 
dimethylformamide (DMF) and condensed with 10 (imole N-Fmoc-glycine 

25 pentafluorophenyl ester for 60 mm at 25°C. After ethanol precipitation and 

centrifugation, the Fmoc group is cleaved off by a 10 min treatment with 100 ul of a 
solution of 20% piperidine in N.N-dimethylformamide. Excess piperidine, DMF and the 
cleavage product from the Fmoc group are removed by ethanol precipitation and the 
precipitate lyophilized from 10 mM TEAA buffer pH 7.2. This material is now either 

30 used as primer for the Sanger DNA sequencing reactions or one or more glycine residues 
(or other suitable protected amino acid active esters) are added to create a series of mass- 
modified primer oligonucleotides suitable for Sanger DNA or RNA sequencing. 
Immobilization of these mass-modified nucleic acid primers UP 0 " 1 after pnmer-extension 
during the sequencing process can be achieved as described e n t tY -V^ T 17 < " *~ t 
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EXAM PI F 6 

Synthesis of nucleic acid primers mass-modified at C-5 of the heterocyclic base of a 
pyrimiHine nucleoside with glycine residues 

Starring material was 5-<3-aminopropynyl-l )-3' 5'-di-p-tolyldeoxvuridine 
prepared and 3' S'-de-O-acylated according to literature procedures (Haralambidis a a!.. 
Nucleic Acids Fes jj, 4857-76 (1987)). 0.281 g (1.0 mmole) 5-(3-aminopropynyl-l)-2'- 
deoxyuridine were reacted with 0.927 g (2.0 mmole) N-Fmoc-glvcine 
pentafluorophenylester in 5 ml absolute N.N-dimethylformamide in the presence of 0.129 
g (1 mmole; 174 ul) N.N-diisopropylethylamine for 60 min at room temperature. Solvents 
were removed by rotary evaporation and the product was purified bv silica eel 
chromatography (Kieselge! 60. Merck: column: 2.5x 50 cm. elution with 
chJoroform/'methanol mixtures!. Yield was 0.44 g (0.78 mmole. 78 %). In order to add 
another glycine residue, the Fmoc group is removed with a 20 min treatment with 20% 
15 solution of piperidine in DMF. evaporated in vacuo and the remaining solid material 
extracted three times with 20 ml ethylacetate. After having removed the remaining 
ethylacetate. N-Fmoc-glycine pentafluorophenylester is coupled as described above. 5-(3- 
(N-Fmoc-glycyl)-amidopropynyl-l)-2'-deoxyuridine is transformed into the 5'-0- 
dimethoxytritylated nucleoside-3'-0-J}-cyanoethyl-N,N-diisopropylphosphoamidite and 
20 incorporated into automated oligonucleotide synthesis by standard procedures (H. Kdster 
et a!., Nucleic Acid? Res 12. 2261 (1984)). This glycine modified thymidine analogue 
building block for chemical DNA synthesis can be used to substitute one or more of the 
thymidine/uridine nucleotides in the nucleic acid primer sequence. The Fmoc group is 
removed at the end of the solid phase synthesis with a 20 min treatment with a 20 % 
solution of piperidine in DMF at room temperature. DMF is removed by a washing step 
with acetonitrile and the oligonucleotide deprotected and purified in the standard way. 

EXAMPI F 7 



-5 



iO 



Synthesis of a nucleic acid primer mass-modified at C-5 of the heterocyclic base of a 
pyrimidine nucleoside with C-alanine residues 

Starting material was the same as in EXAMPLE 6. 0.281 g (1.0 mmole) 
5-(3-.Aminopropyny]-l)-2'-deoxyuridine was reacted with N-Fmoc-B-alanine 
pentafluorophenylester (0.955 g. 2.0 mmole) in 5 ml N.N-dimethylformamide (DMF) in 
the presence of 0.129 g H74 u!: ! 0 mmole! \' V.-i^n^n. u.i. <-..-- 



.r.ou,^-; 1,-aian-nL- rnoie:;. car. r>e aaded :n exactly the same way after removal of the Fi 
group. The preparation of the f'-O-dimethoxytntylated nucleoside-S'-O-B-cvanoethvl- 
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N-N-dnsopropylphosphoamidite from 5.(3.,N.Fmoc.B.alanv]).amidopropvnvl.l 
deoxyundine and incorporation into automated oligonucleotide svnthesis is performed 
under standard conditions. Th:s building block can substitute for am 0 f th- 
thymidine/uridine residues in the nucleic acid pnmer sequence. In the case of or«v one 
incorporated mass-modified nucleotide, the nucleic acid pnmer molecules prepared 
according to EXAMPLES 6 and 7 would have a mass difference of]4 dalt0PS 



EXAMPT.F 8 



10 



15 



20 



25 



30 



Synthesis of a nucleic acid primer mass-modified at C-5 of the heterocyclic base of a 
pynmid.ne nucleoside with ethylene glycol monomethyl ether 

As a nucleoside component. 5-(3-aminopropynyl-l )-2-deoxvundine was 
used m this example (see EXAMPLES 6 and 7). The mass-modifv,n g fiinctionalitv was 
obtamed as follows: 7.61 g (100.0 mmole) freshly distilled ethvlene Li ™™..u... 
ether dissolved in 50 ml absolute pyridine was reacted with 10.01 g (100 0 mmolef 
recrvstalhzed succinic anhydride in the presence of 1 .22 s (1 0.0 mmole) 4-N N- 
d^Luninopvridine overnight at room temperature. The reaction was terminated by 
the addition of water (5.0 ml), the reaction mixture evaporated ,„ vacuo, co-evaporated 
twice with dry toluene (20 ml each) and the residue redissolved in 100 ml 

t^ZTT ? e so,utlon was extracted successively ' 10 % ~ --C 

acid (2 x 20 ml) and once with water (20 ml) and the organic phase dned over anhvdrous 
sod.un i sulfate. The organic phase was evaporated in vacuo, the residue redissolved in 50 
ml dichloromethane and precipitated into 500 ml pentane and the precipitate dried /* 
vacuo. Yield was 13.12 g (74.0 mmole; 74 %). 8.86 g (50.0 mmole) of succinvlated 
ethylene glycol monomethyl ether was dissolved in 100 ml dioxane containing 5% drv 
pyridine (5 ml) and 6.96 g (50.0 mmole) 4-nitrophenol and 10.32 g (50.0 mmole) ' 
dicyclohexylcarbodiimide was added and the reaction run at room temperature for 4 hours 
Dicyclohexylurea was removed by filtration, the filtrate evaporated in vacuo and the 
residue redissolved in 50 ml anhydrous DMF. 12.5 ml (about 12.5 mmole 4- 
nitrophenylester) of this solution was used to dissolve 2.81 e U 0 0 mmoH ^ 
^nop rop ynyl-l ) -2'-deoxyaindine. The reaction was performed m the present of 1.01 g 
(10-0 mmole; 1 .4 ml) tnethylamme at room temperature overnight. The reaction mixture 
was evaporated in vacuo, co-evaporated with toluene, redissolved in dichJoromethane and 
cbromatographed on silicagel ;Si60. Merck: column 4x50 cm) with 
dichlorometK?n~ - - 



/ ■ nai ^ f - lJ - °- 0) 15 ~ -O-dimetnoxMntA'iated and 

transformed into the nucleoside- r -0-f<-~ V annethv! v v a-- ■ ■ 
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incorporated as a building bioek in the automated oligonucleotide synthesis according to 
standard procedures. The mass-modified nucleotide can substitute for one or more of the 
thymidine/uridine residues in :he nucleic acid primer sequence. Deprotection and 
purification of the primer oligonucleotide also follows standard procedures. 

5 

E XAMPL E 9 

Synthesis of a nucleic acid primer mass-modified at C-5 of the heterocyclic base of a 
pyrimidine nucleoside with diethylene glycol monomethyl ether 

10 Nucleoside starting material was as in previous examples, 5-(3- 

aminopropynyl-1 )-2'-deoxyuridine. The mass-modifying functionality was obtained 
similar to EXAMPLE 8. 12.02 g (100.0 mmole) freshly distilled diethylene glycol 
monomethyl ether dissolved in 50 ml absolute pyridine was reacted with 10.01 g (100.0 
mmole ) recrystallized succinic anhydride in the presence of 1.22 g (10.0 mmole) 4-N. N- 

15 dimethylaminopyridine (DMAP) overnight at room temperature. The work-up was as 
described in EXAMPLE 8. Yield was 18.35 g (82.3 mmole, 82.3 %). 1 1.06 g (50.0 
mmole) of succinylated diethylene glycol monomethyl ether was transformed into the 4- 
nitrophenylester and. subsequently, 12.5 mmole was reacted with 2.81 g (10.0 mmole) of 
5-(3-aminopropynyl-l)-2 , -deoxyuridine as described in EXAMPLE 8. Yield after silica 

20 gel column chromatography and precipitation into pentane was 3.34 g (6.9 mmole, 69 %). 
After dimethoxytritylation and transformation into the nucleoside-B- 
cyanoethylphosphoamidite, the mass-modified building block is incorporated into 
automated chemical DNA synthesis according to standard procedures. Within the 
sequence of the nucleic acid primer one or more of the thvmidine/uridine residues 

25 can be substituted by this mass-modified nucleotide. In the case of only one incorporated 
mass-modified nucleotide, the nucleic acid primers of EXAMPLES 8 and 9 would have a 
mass difference of 44.05 daltons. 

EXAMPLE 10 

30 

Synthesis of a nucleic acid primer mass-modified at C-8 of the heterocyclic base of 
deoxyadenosine with glycine 

Starting material was N6-benzoy]-8-bromo-5 '-0-(4,4'-dimethox\1xityl)-2 , - 
deoxyadenosine prepared according to literature (Singh et a/., Nucleic Acids Res. 1 8 . 
35 3339-45 (1990)). 532.5 mg (1.0 mmole) of this 8-bromo-deoxyadenosine derivative was 

suspended ; ^ " ^ ^ Ur " " ■ '* ; ' ■ : 

-.^oprop- \c\r.\ iammt: ^::z — * -!\- : -r.i:: \:vj \\\:\~\:vl :;__icl'mu^ materia. ;:ad disappeared 
(4-6 hours) as checked by thin iaver chromatography (TLC). The solvent was evaporated 
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and the residue purified by silica gel chromatography (column 2.5x50 cm) using solvent 
mixtures of chloroform methanol containing 0.1 °/o pyridine. The product fractions were 
combined, the solvent evaporated, the fractions dissolved in 5 ml dichloromethane and 
precipitated into 100 ml pentane. Yield was 487 mg (0.76 mmole, 76 %). Transformation 
5 into the corresponding nucleoside-B-cyanoethylphosphoamidite and mtegration into 

automated chemical DNA synthesis is performed under standard conditions. During final 
deprotection with aqueous concentrated ammonia, the methyl group is removed from the 
glycine moiety. The mass-modified building block can substitute one or more 
deoxyadenosine/adenosine residues in the nucleic acid primer sequence. 

0 

EXAMPLE 11 



Synthesis of a nucleic acid primer mass-modified at C-8 of the heterocyclic base of 
deorv adenosine with glycyiglycine 

15 This derivative was prepared in analogy to the glycine derivative of 

EX-AMPLE 10. 632.5 mg (1.0 mmole) N 6 -Benzoyl-8-bromo-5 , -0-(4.4 , -dimethox\trityl)- 
2'-deoxyadenosine was suspended in 5 ml absolute ethanol and reacted with 324.3 mg (2.0 
mmole) glycyl-glycine methyl ester in the presence of 241.4 mg (2.1 mmole, 366 \±\) 
N, N-diisopropylethylamine. The mixture was refluxed and completeness of the reaction 

20 checked by TLC Work-up and purification was similar to that described in EXAMPLE 
10. Yield after silica gel column chromatography and precipitation into pentane was 464 
mg (0.65 mmole, 65 %). Transformation into the nucleoside-B-cyanoethylphosphoamidite 
and into synthetic oligonucleotides is done according to standard procedures. In the case 
where only one of the deoxyadenosine/adenosine residues in the nucleic acid primer is 

25 substituted by this mass-modified nucleotide, the mass difference between the nucleic acid 
primers of EXAMPLES 10 and 1 1 is 57.03 daltons. 



EXAMPLE 12 



30 Synthesis of a nucleic acid primer mass-modified at the C-2' of the sugar moiety of 
I'-amino-I'-deoxythymidine with ethylene glycol monomethyl ether residues 

Starting material was 5 , -0-(4,4-dimethoxytrityI)-2'-amino-2 , -deox\ r thymidine 
synthesized according to published procedures (e.g., Verheyden et J. Org. Chem . 36 . 
250-254 (1971); Sasaki et al., J Org. Chem . 4L 3138-3143 (1976); Imazawa et al. J Org 
35 Ch£HL 44. 203Q-2O4 1 (\ 0^0 . T ~\ 



mmoiei was reaetec with 2.0 mmoie of the 4-nnrophenyl ester of succinylated ethvlene 
glycol monomethyl ether (see EXAMPLE 8 i in 10 ml drv DMF in the presence 1 ^- 
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-n.de ,140 „„ tnethyiannne for , 8 hours a, room temperature. ^ reactlon mKlm 
evaporated ,„ w co-evapora,ed with tolucne . redlssoK , d ,„ dlchlor<Jnlethane Md 
punfied by s.l.ca ge! chromatography (Si60. Merck; column: 2.5x50 cnv eluenr 
_ chlorofonrvrnethanol mixtures containing 0. 1 % tnethylamine,. The product contains 
> fractions were combined, evaporated and prectpitated mto pentane. Y.eld was 5* mo " 
(U.7j mmol; 73 %). Transformation into the nucleosid=-6-cvanoethyl-N N 
dnsopropylphosphoarnidite and incorporation into the automated chemical DNA svnthesi, 
protocol ,s performed by standard procedures. The mass-modified deoxvd,vmidme 
denvat.ve can substitute for one or more of the thymidme restdues m the nucleic acid 
10 pnmer. u 

die*, , , 'T ana '° e0US Wa> '- b> ' emPl °-™ g * e ^P^y' of succinvlated 
d.ethylene glycol monomethyl ether (see EXAMPLE 9) and methylene glvcol ' 

monomethyl ethe, the corresponding mass-modified oligonucleotides are prepared 1„ the 
case of only one tncorporated mass-modifled nucleoside wtthin the sequence, me mass 

Ks.l and 1j>2.1d daltons respectively. 

EXampi r p 

20 f^f "? r Cl " C Prin " r ™*™™°0 - » -"rnudeotidic Itak ,„e via 
alleviation of phospborothioate groups 

standard n m P H h0Sph ; roth '° att - ro " uini '>8 oligonucleotides were prepared according to 
standard procedures (see e.g. Ga„ „ a,.. Nucleic A rids Ph.. 12 1183 (1991)) One 

* Tdtr Hnka8eS ^ 65 m ° dif,ed ta "* ^ Th = ^ nucleic 

umo, ™ ' ^^CGACQGCCAGT was synthesized in 0 ,5 

La, h °" 2 SyntheS ' 2CT "* °" •* 8-p introduced after the 

fi»l synthes.s cycle (G to T coupling). Suction, deprotectton attd purification 

.0,1.4 nmole phos P horo,h,oa,e groups. Alkylation was performed by dissolving the " 
30 res, ue ,n ,1^, TE buffer (0.01 M Tns pH 8.0. 0.00, M EDTA) and bv addtn „u, of 
a so.ut.on of 20 mM solution of 2-iodoethanol (320 nmole; i.e.. ,0-fold excess w*t 
r spec, ,0 phosphorothioate diesters, tn KN-dunethylformamide (DMF). The alleviated 
ohgonucleotide was purged by sundard reversed phase HPLC (RP-18 Ultrapnere 
Beckman; column: 4.5 x 250 mm; 100 mM tnethylarnmonium acetate, pH 7 0 and a 
35 gradientof5to40%acetonitrile) 



ranger seauencmc rea^no^ 



MX^ST? T 0aUC " 0 '' "* SeqUe " C,nE rMC " OTS - as exemoMed 

m EXAMPLES I - 4. cleaved off the sohd support. ,y„phtl, 2 ed and dissolved ,„ 4 tdeach 
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of TE buffer pH 8.0 and alkylated by addition of 2 ul of a 20 mM solution of 2- 
iodoethanol in DMF. It is then analyzed by ES and or MALDI mass spectrometry. 

In an analogous way. employing instead of 2-iodoethanoL e.g.. ?- 
iodopropanol, 4-iodobutano! mass-modified nucleic acid pnmer are obtained with a mass 
5 difference of 14.03. 28.06 and 42.03 daltons respectively compared to the unmodified 
phosphorothioate phosphodiester-containing oligonucleotide. 



E^MPLE 14 



10 Synthesis of Z'-amino-Z'-deoxyuridine-S'-triphosphate and S'-amino-Z'.S'- 

dideoxythymidine-S'-tripbosphate mass-modified at the 2'- or 3 , -amino function with 
glycine or B-alanine residues 

Staning material was 2'-a2ido-2 , -deoxyuridine prepared according to 
literature (Verheyden et al. J. Org Chem. 26, 250 (1971)). which was 4.4- 

15 dimethoxytritylated at 5-OH with 4,4-dimethoxytrityl chloride in pyridine and acerylated 
at 3'-OH with acetic anhydride in a one-pot reaction using standard reaction conditions. 
With 191 mg (0.71 mmole) 2 , -azido-2'-deoxyuridine as starting material, 396 mg (0.65 
mmol, 90.8 %) 5'-0-(4 ? 4-dimethox>irityl)o'-0-acetyl-2 , -azido-2 , -deoxuridine was 
obtained after purification via silica gel chromatography. Reduction of the azido group 

20 was performed using published conditions (Barta et ai, Tetrahedron 46 . 587-594 (1990)). 
Yield of 5*-0-(4.4-dimethox\irit> r l)-3 , -0-acety , l-2'-amino-2 , -deoxy'uridine after silica gel 
chromatography was 288 mg (0.49 mmole; 

76 %). This protected 2 f -amino-2'-deoxyuridine derivative (588 mg, 1.0 mmole) was 
reacted with 2 equivalents (927 mg, 2.0 mmole) N-Fmoc-glycine pentafluorophenyl ester 

25 in 1 0 ml dry DMF overnight at room temperature in the presence of 1 .0 mmole (1 74 jal) 
N.N-diisopropylethylamine. Solvents were removed by evaporation in vacuo and the 
residue purified by silica gel chromatography. Yield was 71 1 mg (0.71 mmole. 82 %). 
Detritylation was achieved by a one hour treatment with 80% aqueous acetic acid at room 
temperature. The residue was evaporated to dryness, co-evaporated twice with toiuene, 

30 suspended in 1 ml dry acetomtrile and 5'-phosphorylated with POCI3 according to 

literature (Yoshikawa et al. Bull. Chem. Soc. Japan 42. 3505 (1969) and Sowa et al, 
Bull. Che m. Soc. Japan 48. 2084 (1975)) and directly transformed in a one-pot reaction to 
the 5'-triphosphate using 3 ml of a 0.5 M solution (1.5 mmole) tetra (tri-n- 
buryiammonium) pyrophospha:. in DMF according to literature fe e Seela cr n 1 



:;;;x:u:l* ^vaporaiec anc ivopruiizec. h'unlication aiso 
iollowed standard procedures by using anion-exchange chromatography on DEAE- 
Sephadex with a linear u~ad:er* or" rriethvlamrr. i:rr r\ jarb;-r,a:.- ' V' ' V 
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Triphosphate containing fracuons .checked by lhl „ layer chroma.oeraphv on 
polye,„y l e„e,n,,ne ceHuiose plates, were collected, evaporated and Ivophil.zed Vie,d ,„ 
LV-absorbar.ee of the uractl rnotery , was 68%, 0 48 mmole) ' 

5 otai ,k ^ lyCyl " £lyCinera0di,1ed -- Mino --- d - x >™^"e-5-- U -iphospha,e«as 
(>.-9-fiuore„v lm e t h ; loX yca r bo I ,y 1 - g lyc y | > :,a r n, n o.r.deo X vuridi„e bva 0 „e hour " 

s~':r f :,d 0% * - °- - - P ^e :: P h :: 10n of 

Fm 0 *i«h >olue„e a,d subseouen, condensation with V- 
F^ocg K c ,„e pentafluorophenyl ester. Starting , ilh , . 0 ^ oflhe : , N .„ lvcvM . 

Starting w^th 5'-0-(4.4-dimethoxytn^l)-3'-0-acetv!-2'-ammo--- 
aeoxyundine and coupiine with K-Fmnr a a u„- 
5 cnr^^Hi-. ~ „ , . ^O'-B-alarune pentafluorophenyl ester, ,he 

svmhe7" 7 rJ " V, ""'" ,irayl> - ■ ammo - 2 '- deox >^*ne.S'- t riphospha,e can be 

ricineTa 7 ^ Pnn ™ Si °" ™e -ass difference 

the glycine. B-alanme and glycyl-glycine mass-modified nucleosides is ner nucl,„, H 
mcorporated, 58.06. 72.09 and 1,5, dahons respectively. ' ' 

tt ' hensta " i "8»-ith5'-0-(4.4.dim=moxytriryl)-3'-ammo.2'3'- 
d,deo X y,hy midi ne (chained by published procedures, see EXAMPLE P, me 

dki a tenninating nucleotide unit in the Saneer 

DNA sequencing reactions providing a mass difWn. • g 

EXAlyTPf F k 

EXAMPLE 6^ 81 ' (1 '°r i0le) 5 - C3 - Amin °P ro P--y 1 -l)-2'-deox > a m d 1 ne (see 
EXAMPLE 6) was reacted with either 0.927 g f2 .0 mmole^ V.p- ...J 

and the conden^nnnT'"' b ° ,Vents were r «™ved by evaporation m v fl «, 0 

uc t-onaensation products nun fled hv fi^c-k ~k 

P-imiea b> flash chromatography on sihea gel rStill et a! . 
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.1 Org Chem. 42. 2923-2925 (]9~8)). Yields were 476 mg (0.85 mmole: 85%) for the 
glycine and 436 mg (0.76 mmole: ~6%) for the B-aiamne derivatives. For the svnthesis of 
the glycyl-glycine derivative, the Fmoc group of 1 .0 mmole Fmoc-glycine-deoxvundinc 
derivative was removed by one-hour treatment with 20% piperidine in DMF at room 
5 temperature. Solvents were removed by evaporation in vacuo, the residue was co- 
evaporated twice with toluene and condensed with 0.927 g (2.0 mmole) N-Fmoc-eJvcine 
pentafluorophenyl ester and purified as described above. Yield was 445 mg (0.72 mmole; 
72%). The glycyl-. glycyl-glycyl- and 6-alanyl-2'-deoxyuridine derivatives. N-protected 
with the Fmoc group were transformed to the 3-O-acetyI derivatives by tritylation with 
10 4,4-dimethox\triryl chloride in pyridine and acetylation with acetic anhvdride in pyridine 
in a one-pot reaction and subsequently detritylated by one hour treatment with 80% 
aqueous acetic acid according to standard procedures. Solvents were removed, the 
residues dissolved in 100 ml chloroform and extracted twice with 50 ml 1 0% sodium 
bicarbonate and once with 50 mi water, dried with sodium sulfate, the solvent evaporated 
15 and the residues purified by flash chromatography on silica gel. Yields were 36 1 me (0.60 
mmoie; 71%) for the glycyl-. 351 mg (0.57 mmole; 75%) for the B-alanyl- and 323 mc 
(0.49 mmole; 68%) for the glycyl-glycyl-3-0'-aceryl-2'-deoxyuridine derivatives 
respectively. Phosphorylation at the 5'-OH with POCl 3 . transformation into the 5'- 
triphosphate by in-situ reaction with tetra(tri-n-butylarnmomurn) pyrophosphate in DMF, 
20 3'-de-0-acetylation, cleavage of the Fmoc group, and final purification by anion-exchange 
chromatography on DEAE-Sephadex was performed as described in EXAMPLE 14. 
Yields according to UV-absorbance of the uracil moiety were 0.4 1 mmole 5-(3-(N- 
glycyI)-amidopropynyl-l)-2'-deoxyuridine-5'-triphosphate (84%), 0.43 mmole 5-(3-(N-B- 
alanyl)-amidoprop>'nyl-l>2 , -deox\Tiridine-5 , -triphosphate (75%) and 0.38 mmole 5-(3-(N- 
25 glycyl-glycyl)-amidopropynyl- 1 )-2'-deoxyuridine-5'-triphosphate (78%). 

These mass-modified nucleoside triphosphates were incorporated during the 
Sanger DNA sequencing primer-extension reactions. 

VvTien using 5-(3-aminopropynyl-l )-2'.3'-dideoxyuridine as starting material 
and following an analogous reaction sequence the corresponding glycyl-. glycyl-glycvl- 
and B-alanyl-2'.3'-dideoxyuridine-5'-triphosphates were obtained in yields of 69. 63 and 
71% respectively. These mass-modified nucleoside triphosphates serve as chain- 
terminating nucleotides during the Sanger DNA sequencing reactions. The mass-modified 
sequencing ladders are analyzed by either ES or MALDI mass spectrometry. 



30 



synthesis ui H-glycyi- and 8-glycy|-g|ycyl-2 -deox> adenosme-5-triphosphate 

727 mg (1.0 mmolei of N 6 -(4-ten-burviphenoxyaceryl j-8-glvcvl-f-i 4.4- 
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butylphenoxyaceryb-S-gi\xv^^ prepared 
according to EXAMPLES 10 and 1 1 and literature (Roster et al.. Tetrahedron V7. 362 
(1981)) were acerylated with acetic anhydride in pyridine at the 3 '-OH, detriniated at the 
5'-position with 80% acetic acid in a one-pot reaction and transformed into the 5 l - 
5 triphosphates via phosphorylation with POC1; and reaction in-situ with tetra(tri-n- 

butylammonium) pyrophosphate as described in EXAMPLE 14. Deprotection of the N 6 - 
tert-butylphenoxyacery L the 3'-Oacetyl and the O-methyl group at the glycine residues 
was achieved with concentrated aqueous ammonia for ninety minutes at room 
temperature. Ammonia was removed by lyophilization and the residue washed with 
10 dichloromethane, solvent removed by evaporation in vacuo and the remaining solid 

material purified by anion-exchange chromatography on DEAE-Sephadex using a linear 
gradient of triethylammonium bicarbonate from 0.1 to 1.0 M. The nucleoside triphosphate 
containing fractions (checked by TLC on polyethyieneimine cellulose plates) were 
combined and lyophillized. Yield of the S-glycyl^'-deoxyadenosineo'-triphosphate 
15 (determined by UV-absorbance of the adenine moiety) was 57% (0.57 mmole). The yield 
for the S-glycyl-glycyl-Z'-deoxyadenosineo'-triphosphate was 51% (0.51 mmole). 

These mass-modified nucleoside triphosphates were incorporated during 
primer-extension in the Sanger DNA sequencing reactions. 

When using the corresponding N6-(4-tert-butylphenoxyacetyl)-8-glycyl- or - 
20 glycyl-glycyl-5 , -0-(4 ? 4-dimethox^TOty'l)-2 , J-dideoxyadenosine derivatives as starting 
materials prepared according to standard procedures (see, e.g., for the introduction of the 
2\3*-function: Seela et al. Helvetica Chimica Acta 74 1048-1058 (1991)) and using an 
analogous reaction sequence as described above, the chain-terminating mass-modified 
nucleoside triphosphates 8-glycyl- and S-glycyl-glycyl-I'^'-dideoxyadenosine-S'- 
25 triphosphates were obtained in 53 and 47% yields respectively. The mass-modified 
sequencing fragment ladders are analyzed by either ES or MALDI mass spectrometry. 

EXAMPLF 17 

30 Mass-modification of Sanger DNA sequencing fragment ladders by incorporation of 
chain-elongating 2'-deoxy- and chain-terminating 2\3'-dideoxy thymidine-5 f -(alpha- 
S-)-triphosphate and subsequent alkylation with 2-iodoethanol and 3-iodopropanoI 

2',3'-Dideox\thymidine-5 , -(alpha-SHriphosphate was prepared according to 
published procedures (e.g.. for the alpha-S-tnphosphate moiety: Eckstein et al, 
35 Biochemistry If. 1685 (]Q76) and Accounts Che- R?s \2. ?04 no^i and fn- -v. 



penormec accoraing to stancard protocols (.e.g. Eckstein. Ann Rev Biochem. £4. 36" 
(1985) ). When using -\?'-dideox>ihymidine-5'-(alpha-S)-tnphosphates. this is used 



WO 94/16101 



-38 



PCT/US94 00193 



instead of the unmodified Z^'-dideoxxthymidineo'-triphosphate in standard Sanger DNA 
sequencing fsee e.g. Swerdlow et al. m Nucleic Acids Res. 1 S. 1415-1419 (1990V). The 
template (2 pmole) and the nucleic acid M13 sequencing pnmer (4 pmole) modified 
according to EXAMPLE 1 are annealed by heating to 65°C in 100 ul of 10 mM Tns-HCl 
5 pH 7.5, 10 mM MgCK 50 mM NaCL 7 mM dithiothreitol (DTT) for 5 min and slowly 
brought to 37°C during a one hour penod. The sequencing reaction mixtures contain, as 
exemplified for the T-specific termination reaction, in a final volume of 150 ul, 200 uM 
(final concentration) each of dATP. dCTP. dTTP, 300 uM c7-deaza-dGTP, 5 uM 2.3'- 
dideox\lhymidine-5 ! -(alpha-S)-triphosphate and 40 units Sequenase (United States 

10 Biochemicals). Polymerization is performed for 10 min at 37°C. the reaction mixture 
heated to 70°C to inactivate the Sequenase. ethanol precipitated and coupled to thiolated 
Sequelon membrane disks (8 mm diameter) as described in EX-AMPLE 1 . Alkylation is 
performed by treating the disks with 10 ul of 10 mM solution of either 2-iodoethanol or 3- 
iodopropanol in NMM (N-methylmoipholine/water/2-propanoI. 2/49/49, v/v/v) (three 

15 times), washing with 10 ul NMM (three times) and cleaving the alkylated T-terminated 
primer-extension products off the support by treatment with DTT as described in 
EX-AMPLE 1 . Analysis of the mass-modified fragment families is performed with either 
ES or MALDI mass spectrometry. 

20 EXAMPLE 1 $ 

Analysis of a Mixture of Oligotbymidylic Acids 

Oligothymidylic acid, oligo p(dT)j2-18> * s commercially available (United 
States Biochemical, Cleveland. OH). Generally, a matrix solution of 0.5 M in ethanol was 
25 prepared. Various matrices were used for this Example and Examples 19-21 such as 3,5- 
dihydroxybenzoic acid, sinapinic acid, 3-hydroxypicolinic acid, 2.4,6- 

trihydroxyacetophenone. Oligonucleotides were lyophilized after purification by HPLC and 
taken up in ultrapure water (MilliQ. Millipore) using amounts to obtain a concentration of 10 
pmoles/jj.1 as stock solution. .An aliquot (1 of this concentration or a dilution in ultrapure 
30 water was mixed with 1 p.1 of the matrix solution on a flat metal surface serving as the probe 
tip and dried with a fan using cold air. In some experiments, cation-ion exchange beads in 
the acid form were added to the mixture of matrix and sample solution. 

MALDI-TOF spectra were obtained for this Example and Examples 19-21 on 
different commercial instruments such as Vision 2000 (Tinnigan-MAT). VG TofSpec fFison^ 

venerate:: • '•:n_nw-:; ::: TJ. . - Ma^ calibration v> a< aone externally ana generals 

acnieved by using defined peptides of appropriate mass range such as insulin, gramicidin S. 
tripsinogen, bovine serum albumen, and rviochrome r -\!! ^rcz:.r:i wcrv r-:' .: v -- 
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empio> iriLZ a nitrogen laser w:th 5 nsec pulses at a wavelength of 53 7 nm. Laser energy 
varied between ! 0^ and 1 1' YV cm-. To improve signal-io-noise ratio generally, the 
intensities of 10 to 30 laser shots were accumulated. 

EXAMPLE 19 

Mass Spectrometric Analysis of a 50-mer and a 99-mer 

Two iarge oligonucleotides were analyzed by mass spectrometry. The 50-mer 
d (TAACGGTCATTACGGCCATTGACTGTAGGACCTGCATTACATGACTAGCT) i SEQ 
ID NO;3 ) and dT(pdT)QQ were used. The oligodeoxynucleotides were synthesized using P 
-cyanoethylphosphoamidites and purified using published procedures. (e.g. N.D. Sinha. J. 
Biernat. J. McManus and H. Roster. Nucleic Acids Res.. 12. 4539 (1984)) employing 
commercially available DNA synthesizers from either Millipore (Bedford. MA) or Applied 
Biosy stems (Foster City. CA) and HPLC equipment and IIP 18 reverse phase columns from 
waters (Miiford. MA). The samples for mass spectrometric analysis were prepared as described 
in Example 18. The conditions used for MALDI-MS analysis of each oligonucleotide were 500 
fmol of each oligonucleotide, reflectron positive ion mode with an acceleration of 5 kV and 
postacceleration of 20 kY. The MALDI-TOF spectra generated were superimposed and are 
shown in FIGURE 15. 

EXAMPLE 20 

Simulation of the DNA Sequencing Results of FIGURE 2 

The 13 DNA sequences representing the nested dT-terminated fragments of the 
25 Sanger DNA sequencing for the 50-mer described in Example 19 (SEQ ID NO:3) were 
synthesized as described in Example 19. The samples were treated and 500 fmol of each 
fragment was analyzed by MALDI-MS as described in Example 18.. The resulting MALDI- 
TOF spectra are shown in FIGURES 16A-16M. The conditions were reflectron positive ion 
mode with an acceleration of 5 kV and postacceleration of 20 kV. Calculated molecular masses 
30 and experimental molecular masses are shown in Table 1 . 

The MALDI-TOF spectra were superimposed (FIGURES 17A and 17B) to 
demonstrate thai the individual peaks are resolvable even between the 10-mer and 1 1-mer (upper 
panel) and the 37-mer and 38-mer (lower panel). The two panels show two different scales and 
the spectra analyzed at that scale. 
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EXAMPLE 21 

MALDI-MS Analysis of a Mass-Modified Oligonucleotide 

A 1 7-mer was mass-modified at C-5 of one or two deoxyuridine moieties. 5 - [ 1 3 - 
C2-MethoxyethoxyI)-tridecyne-l -yl]-5 , -0-(4.4'-dimethox\nri 

N\ N-diisopropylphosphoamidite was used to synthesize the modified 1 7-mers using the methods 
described in Example 19. 

The modified 1 7-mers were 

X 

a: d (TAAAACGACGGCCAGUG) (molecular mass: 5454) 
(SEQ IDNO:4) 

X X 

! ! 

b: d (UAAAACGACGGCCAGUG) (molecular mass 5634) 
(SEQ IDNO:5) 

where X = -C=C-(CH 2 )n-OH 

(unmodified 1 7-mer: molecular mass: 5273) 



The samples were prepared and 500 fmol of each modified 1 7-mer was 
analyzed using MALDI-MS as described in Example 18. The conditions used were 
25 reflectron positive ion mode with an acceleration of 5 kV and postacceleration of 20 kV. 
The MALDI-TOF spectra which were generated were superimposed and are shown in 
FIGURE 18. 

All of the above-cited references and publications are hereby incorporated by 

30 reference. 



EQUIVALENTS 
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Those skilled in the art will recognize, or be able to ascertain using no more 
than routine experimentation, numerous equivalents to the specific procedures described 

herein ^nch ^mMV^ntr -i*-,. — j * 
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(i) APPLICANT: 

(A) NAME: KOSTER, HUBERT 

(B) STREET: 15 4 0 MONUMENT STREET 

(C) CITY: CONCORD 

10 (D; STATE: MASSACHUSETTS 

{ E ) COUNTRY: USA 

( F ) POSTAL CCOZ (ZI?: : CI 742 

(G) TZLEPHC:CE : (506- 369-9790 

15 (ii) TITLE OF INVENTION : UNA SEQUENCING BY MASS SPECTROMETRY 

(iii) NUMBER OF SEQUENCES: 5 

(v) COMPUTER READABLE FORK : 
20 (A) MEOIUT-* TYPE: Flcppy disk 

(B) COMPUTER: I3M PC compatible 

(C) OPERATING SYSTEM: PC -DOS / MS-DOS 

(D) SOFTWARE: ASCII (text) 

25 (vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER : 

(B) FILING DATE: 06- JAN - 1 9 9 4 

(C) CLASSIFICATION: 

30 (vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/001,323 

(B) FILING DATE: 07-JAN-1993 

(C) CLASSIFICATION: 1807 

35 (viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: DeConti , Giulio A. 

(B) REGISTRATION NUMBER: 31,503 

(C) REFERENCE /DOCKET NUMBER: HKI - 0 0 3 CP 

40 dx) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (617) 227-7400 

(B) TELEFAX: (€17) 227-5 941 



45 (2) INFORMATION FOR SEC ID NO : 1 : 

(l) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 14 base pairs 
(E) TYPE : nucleic acid 
50 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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CATGCCATGG CA.TG 

(I? INFORMATION FIR cr ~ — :; ~ . _ _ 

(iJ SEQUENT CK^ZZZK Z3ZZZS : 
--vj.-: ^_ sase pairs 
fBJ ,:'?£; nucleic acid 
(O STRANDEDNESS : single 
(D) TOPOLOGY: linear 

1 - - - : ether nucieic acid 
(iii) HYPOTKET'CA" • 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 
AAATTGTGCA CATCCTGCAG C 
(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CXIS^CZZRISTZCS : 

(A) LENGTH: 5C base nairs 

(B) TYPE: nucleic acid 
(CJ STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: orher nucleic acid 
(iii) HYPOTHETICAL: YES 



35 

(Xij SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

TAACGGTCAT TACGGCCATT GACTGTAGGA CCTGCATTAC ATGACTAGCT 
40 50 
(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS - 
4S fA) LENGTH: 17 base cairs 

(B) TYPE: nucleic acid 

(C) STRAND ED NESS : single 

(D) TOPOLOGY: linear 

5Q (ii) MOLECULE TYPE: other nucleic acid 

(iii) HYPOTHETICAL: YES 
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(2) INFORMATION FOR SEC II NO : 5 

(l) SEQUENCE CrS^FJ^CZZKZSrZCS : 
(A) LENGTH: 1" base pairs 
(3} TYPE: nucleic acid 

(C) STRANTELNE3S : single 

(D) TOPOLOGY: linear 

fii) MOLECULE TYPE: other nucleic acid 
(ill) HYPOTHETICAL: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 
XAAAACGACG GGCC^GXG 
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1 ■ A method of sequencing a nucleic acid, comprising the steps of 

a) synthesizing complementary nucleic acids which are complements to the 
nucleic acd to be sequenced, starting from a nucleic acid primer and in the 
presence of chain-terminating and chain-elongating nucleotides so as to 
produce four sets of base-specifically terminated complementary, nucle.c acd 
fragments; 

b) defining the molecular weigh, value of each nested fratrment in each of 
me four sets of base-specifically terminated fragments by mass spectrometrv 
wherem the molecular weight values of at ieast mo base-specificallv 
terminated fragments are determined concurrently and 

O determining the nucleotide sequence bv aligning 'the four sets of molecular 
- -vvvwiuK lu njuiecuiar weieiu. 

■ermi„ 3 ^7 eth ° d " C ' aim "* f ° Ur Se,s ° f "-cspecificallv 

values by mass spectrometry. * 

3. The method according to claim 2, wherem the four sets of base-specificallv 
terminated fragments are purified, comprising the steps of ? ' 

a) immobilizing the complementary nucleic acids on a solid support- and 

b) washing out all remaining reactants and by-products. 



25 4. 



30 



comoJ 1 * mt,h0d , aCCMdin - e M data 3 ' toher comprising me step of removin, the 
complementary nucletc acids from the solid support. 

backh The fT th °' i aCCMdin8 10 C ' aim '• " herein 3 COU, " er - iOT of «* P^^hate 

it e ;r; h ;; omp erasmar> ' nuc,e,c K,ds is ™ •* » ^ • — 

6. The method according to claim 1. wherem each of me four sets of base-specificallv 
termmated fragments is synthesized in a separate reaction vessel s P«'f'"'» 



i:>^ interns: 



, ;. ; "' ie ^° ;aun g * e nodular weight values determined fo- ea-b 

of me tour sets of base-sp S =ificaIiy terminated fragments. 
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8. The method according to claim 1. wherein at least two of the four sets of base- 
specificallv terminated fragments are synthesized concurrently m the same reaction vessel. 

9. The method according to claim 8, wherein the chain-terminating nucleotides are 
5 chosen such that addition of one species of the chain-terminating nucleotides to the 

complementary nucleic acid can be distinguished by mass spectrometry* from addition of 
all other species of the chain-terminating nucleotides present in the same reaction vessel. 

10. The method according to claim 1. wherein the molecular weight value of each 
10 nested fragment are determined by matrix-assisted laser desorption/ionization mass 

spectrometry- (MALDI-MS). 

1 1 . The method according to claim 1 in which the molecular weight value of each 
nested fragment are determined by electrospray mass spectrometry (ES-MS). 

15 

12. The method according to claim 1, wherein the complementary nucleic acid is 
synthesized using a nucleic acid primer; at least one deoxynucleotide selected from the 
group consisting of deoxyadenosine triphosphate dATP, deoxythymidine triphosphate 
dTTP, deoxyguanosine triphosphate dGTP, deoxycytidine triphosphate dCTP, 

20 deoxyinosine triphosphate dITP, a 7-deazadeoxynucleoside triphosphate c^dGTP, a 7- 
deazadeoxynucleoside triphosphate c ; dATP, and a 7-deazadeoxynucleoside triphosphate 
c^dlTP; at least one chain-terminating dideoxynucleotide selected from the group 
consisting of dideoxyadenosine triphosphate ddATP, dideoxythymidine triphosphate 
ddTTP, dideoxyguanosine triphosphate ddGTP, and dideoxycytidine triphosphate ddCTP; 

25 and a DNA polymerase. 

13. The method according to claim 1, wherein the complementary nucleic acid is 
synthesized using a nucleic acid primer; at least one nucleotide selected from the group 
consisting of adenosine triphosphate ATP. uridine triphosphate UTP, guanosine 

30 triphosphate GTP. cytidine triphosphate CTP, inosine triphosphate ITP, a 7- 

deazanucleoside triphosphate c^ATP. a 7-deazanucleoside triphosphate c^GTP, and a 7- 
deazanucleoside triphosphate c'ITP; at least one chain-terminating 3'-deoxynucleotide 
selected from the group consisting of deoxyadenosine triphosphate 3' -dATP, deoxyuridine 
triphosphate 3'-dLTP, deoxyguanosine triphosphate 3'-dGTP, and deoxycytidine 

35 triphosphate 3'-dCTP; and an RNA polymerase. 



::'ir.u\_ ou;. , _ . rev er^oi v immoDwizinc tne primer on a solid suDDor: 
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15. The method according to claim 14 whpr^m tu 

Z£ " K ~ - ^ — » - IKE the 

16 



eavable 
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The method according to claim 15. wherein the temporary and cl 
attach™, can be Ceaved en^maticalK, chemicallv or ph^cl 

- -g eiectroir ^T^^^^""} ' - ~ 

^ • bond, an arginme/a^n^ H 3 

Pyrophosphate bond, and a bond created * ^^ST ^ ' 

The method according to claim l * „,u« • L 
•erminated fragments are d^^^f T""^ 
products from the sequencing reaction. ' reraa ' m " g ™ " d *- 

19- TT,e method according ,o claim 18. wherein the base-specificallv „ 
laments are Ceaved from the solid suppon pri0 r ,„ mass spec'met. 

20. The method accordmg to clatm 18. wherein the base-specificallv ,„w „ 
fragments are Ceaved from me so.id support during mass spec,^ ^ 

■erminattng nucleotides, where.n one of he « ; of t * ^ "** ^ ChaM " 
» modified and the other sets of ZT r "^V'™* '=™»>ated fragments 
modified. „d each «*JZ1£^TT« ^ " ™ 
- -rence to be dtsttngutshed from' ^^^3^ 2 ^ 

~. The method according to claim "M • 

modified base-sp-cificallv ,™ " °" e ° f ,he sets ° f "ass- 

funcona,,. ^^S^T! - ^ * ™ di « • ™ djf v, n . 



25 21 



WO 94/16101 -47- PCT-US94/00193 

23. The method according to claim 22. wherein the heterocyclic base-modified 
nucleotide is selected from the group consisting of a cytosine nucleotide modified at C-5. 
a thymine nucleotide modified at C-5. a thymine nucleotide modified at the C-5 methvl 
group, a uracil nucleotide modified at C-5. an adenine nucleotide modified at C-8. a c 7 - 
5 deazaadenine modified at C-8. a c 7 -deazaadenine modified at C-7. a guanine nucleotide 
modified at C-8. a c ; -deazaguanine modified at C-8. a c 7 -deazaguanine modified at C-7. a 
hypoxanthine modified at C-8. a c~-deazahypoxanthine modified at C-7. and a c 7 - 
deazahypoxanthine modified at C-8. 

10 24. The method according to claim 2 1 . wherein at least one of the sets of mass- 
modified base-specifically terminated fragments is modified with a mass-modifying 
functionality (M) attached to one or more phosphorus atoms of the internucleotidic 
linkages of the fragments. 

15 25. The method according to claim 21. wherein at least one of the sets of mass- 
modified base-specifically terminated fragments is modified with a mass-modifyine 
functionality (M) attached to one or more sugar moieties of nucleotides within the set of 
mass modified base-specifically terminated fragments at at least one sugar position 
selected from the group consisting of an internal C-2' position, an external C-2' position. 

20 and an external C-5' position. 

26. The method according to claim 21, wherein at least one of the sets of mass- 
modified base-specifically terminated fragments is modified with a mass-modifying 
functionality (M) attached to the sugar moiety of a 5'-terminal nucleotide and wherein the 

25 mass-modifying function (M) is the linking functionality (L). 

27. The method according to claim 21, wherein a mass-modifying functionality (M) is 
attached to a set of base-specifically terminated fragments subsequent to enzymatic 
synthesis of the base-specifically terminated fragments and prior to determining the 
molecular weight values for the nested fragments by mass spectrometry. 



30 



28. The method according to claim 27, wherein the synthesis of the base-specifically 
terminated fragments is performed by using at least one reagent selected from the group 
consisting of a nucleic acid primer, a chain-elongating nucleotide, a chain-terminating 
35 nucleotide or a tag probe which has been modified with - 1 ?r^u^- 



spectrometry ana.> 
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29. The method according to claim 21. wherein mass differentiation of the tao pro be^ 
is achieved by changing the nucleotide composition of a: least one of the tac nrobes and 
complementary tag sequence in the species of nucleic ac;d. 

5 30. The method according to claim 2 1 . wherein the tag probes are covalentl v bound to 
the corresponding complementary tag sequence pnor to mass spectrometry analysis. 

31. The method according to claim 30. wherein binding between the tac probes and the 
corresponding complementary tag sequences is achieved photochemical via 
10 photoactivatable groups. 



32. 



A method of sequencing a nucleic acid, comprising the steps of 

a) reversibly linking an oligonucleotide pnmer to a solid support through a 



15 b) 



20 c) 



synthesizing complementary nucleic acids which are complementary to th- 
nucleic acid to be sequenced, starting from a nucleic acid primer and in the 
presence of chain-terminating and chain-elongating nucleotides so as to 
produce four sets of base-specifically terminated complementary nucleic acid 
fragments; 

determining the molecular weight value of each nested fragment in each of the 
four sets of base-specifically terminated fragments by matrix assisted laser 
desorption/tonization mass spectrometry wherein the molecular weight values 
of at least two base-specifically terminated fragments are determined 
concurrently and wherein the nested fragments are cleaved from the solid 
support by a laser during mass spectrometry; and 
d) determining the nucleotide sequence by aligning the four sets of molecular 
weight values according to molecular weight. 
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33. A method of multiplex analysis of nucleic acid sequences, comprising the steps of 

a) reversibly linking a nucleic acid primer to a solid support through a linkinc 
group: 

b) synthesizing complementary nucleic acids which are complementary to the 

5 nucleic acid to be sequenced, starting from the nucleic acid primer and in the 

presence of chain-terminating and chain-elongating nucleotides so as to 
produce four sets of base-specifically terminated complementary nucleic acid 
fragments; 

c) determining the molecular weight value of each nested fragment in each of the 
10 four sets of base-specifically terminated fragments by matrix assisted laser 

desorption/ionization mass spectrometry wherein the molecular weicht values 
of at least two base-specifically terminated fragments are determined 
concurrently and wherein the nested fragments are cleaved from the solid 
support by a laser during mass spectrometry; and 
15 d) determining the nucleotide sequence by aligning the four sets of molecular 

weight values according to molecular weight: 
wherein at least one reagent selected from a group consisting of, a nucleic acid 
primer, a chain-elongating nucleotide, or a chain-terminating nucleotide is mass-modified, 
wherein each set of base-specifically terminated fragments has a sufficient mass difference 
from the other sets of base-specifically terminated fragments so as to be unique, and wherein 
the molecular weight values of the nested fragments of two or more sets of unseparated base- 
specifically terminated fragments are determined concurrently. 



20 



WO 94/16101 ^ m <( , 

"- L " PCT1'S94/00193 

34. A kit for sequencing one or more species of nucleic acids bv multiplex specrrometrr 
nucleic acid sequencing, comprising: 

a) a soiid support having a linking functionality (L') : 

b) a set of nucleic acid primers suitable for initiating svmhesis of a set of 
complementary nucleic acids which are complementary to the different 
species of nucleic acids, the primers each including a linking erou P (L ) able 
to interact with the linking functionality (L') and reversibly link the primers 
io the solid support: 

c) a set of chain-elongating nucleotides for synthesizing the comolementarv 

JU nucleic acids; 

d) a set of chain-terminating nucleotides for terminate svmhesis of the 
complementary nucleic acids and generating sets of base-specific terminated 
complementary nucleic acid fragments: and 

e) a polymerase for synthesizing the complementary- nucleic acids from rh. 
nucleic acid pnmers. chain-elongating nucleotides and terminate 
nucleotides. 

wherein a, least one reagent selected from the group consisting of the primers the 
Cham-elongating nucleotides, and the chain-terminating nucleotides is mass ' 
modtfied to provide dtsttnction between each se, of bas.-specificaHv terminated 
nucleotides of each species of nucleic acid by mass spectrometry. 

35. A solid support chosen from the group consisting of magnetic beads, cellulose beads 

SEP^KV ^ COn,r0 " ed 0,255 (CPGX SilKa - 8£l SE ™ S * bJL 

SEPHADEX beads, capillaries, polymeric sheets of polyethylene, polvmeric sheets of 

polypropylene, polymeric sheets of polyamide. polymeric sheets of polveste, polvmeric 

*» of polyvmytidene-difiuoride, gl ass plates, and meta, surfaces the solid supp^ Ling 

e tr b v^rr whkh is abic 10 wi,h a «™* L ' ° f » 
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the „ 5 SUPP ° n aCCOrdmg ,0 Clam 35 ' WhCTei " link ^ L "L'. » -lected from 
the group cons.sttng of a photocleavable bond, a bond based on s,ron e electrostatic 

mteracon. a mtylether bond, a B-benzoylpropionyl group, a lev.linyl gr0U p. a disulfide 

3, ^l^^c'™^ * l>,S,ne ' ' S '" e b °" d - 3 PTOPhOSpha " *»* ^ • ^ 

a creaxea pv Watson-CncV 



-omp„ s ,„g a so h a su P p„n of claim -5 in each we,, for reversible binding a pnmer 
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38. A set of mass-modified nucleic acid primers selected from a group consisting of a 
collection of mass-modified universal primers for priming DNA synthesis, and a collection of 
mass-modified initiator oligonucleotides for initiating transcriptional RNA svnthesis. 

5 39. The set of mass-modified nucleic acid primers according to claim 38, wherein at least 
one of the mass-modified primers is modified with a mass modifying functionality (M) at one 
or more heterocyclic bases within the primers. 

40. The set of mass-modified nucleic acid primers according to claim 39. wherein at least 
10 one of the mass modified primers comprises at least one heterocyclic base-modified 

nucleotide selected from the group consisting of a cytosine nucleotide modified at C-5. a 
thymine nucleotide modified at C-5. a thymine nucleotide modified at the C-5 methvl group, 
a uracil nucleotide modified at C-5. an adenine nucleotide modified at C-8, a c^-deazaadenine 
modified at C-8. a c^-deazaadenine modified at C-7, a guanine nucleotide modified at C-8. a 
15 c^-deazaguanine modified at C-8, a c^-deazaguanine modified at C-7, a hypoxanthine 
modified at C-8. a c^-deazahypoxan thine modified at C-7. and a c^-deazahypoxanthine 
modified at C-8. 

41 . The set of mass-modified nucleic acid primers according to claim 39, wherein at least 
20 one of the mass-modified primers is modified with a mass-modifying functionality (M) 

attached to one or more phosphorus atoms of the internucleotidic linkages within the mass 
modified primer. 

42. The set of mass-modified nucleic acid primers according to claim 39, wherein at least 
25 one of the mass-modified primers is modified with a mass-modifying functionality (M) 

attached to at least one sugar moiety of the nucleotides within the mass-modified primer at at 
least one sugar position selected from the group consisting of an internal C-2' position, an 
external C-2' position, and an external C-5' position. 

30 43. The set of mass-modified nucleic acid primers according to claim 39, wherein at least 
one of the mass-modified primers is modified with a mass-modifying functionality (M) 
attached to the sugar moiery of a 5'-terminal nucleotide of the primer, and wherein the mass- 
modifying function (M) is the linking functionality (L). 

35 44. A set of mass-modified nucleotides selected from the trmur — — 

- — ■ ^::::;;^uni: . > - >vntnes:> 

"loamec nuCiL-L'MLic: inpnospnates suiiaDie lor RNA synthesis, and mass-modified 
3"-deoxynucleoside triphosphates suitable for chain-terminating RNA synthesis. 
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45 The set of mass-modified nucleotides according to claim 44. wherein a r-«- 
nTctotld" fUnCUOna:i 15 anaChed 10 3 heter0CydlC b3Se ° fth < ™ass-modifi7d" 

46. The set of mass-modified nucleotides according to claim 45, wherein the mass- 
modified nucleotide composes a modified heterocyclic base selected from the .roup 
consisting of a cosine moiety modified at C-5. a thymine moietv modified at a 
hymme moiety modified at the methyl group of C-5. a uxaci. moJerv modified at C- an 
a erune mo.ety modified at C-8. a ^-deazaadenine ^ modif]ed ^ ^ 
c -deazaadenine moiety modified at C-7. a guanine moiety modified at C- 8 " a c? 
deazaguanine motety modified at C-8, a c ^guanine moiety modified at C-7 a 
ypoxantn.ne moiety modified at C-8. a c^ypo.xanthine mo.ety modified at C-8 and a 
c '-deazahypoxamh.ne moiety modified at C-7. 

° f mass - modifi ^ ""leoddc according to claim 44. where,* a mass- 

48. The set of mass-modified nucleotides according to claim 44, wherein the mass 
modified nucleotide compnses a deoxynucleoside triphosphate, and a massl! ZZ 

functional tv CM) is attache t~ * n ->» . moairymg 

diphosphate. " P ° Sm0n ° f 3 «** <-vnucleos.de 



25 49. 



The set of mass-modified nucleotides according to clatm 44, wherein the mass 
modtfied nudeotide compnses a dideoxynucleoside triphosphate and a uZZ^L 

consisting of a C-2 position and a C-3 1 position. " 

30 50. A s=, of mass-differentiated tag probes complement, bv Wa.son-Cnck base 
= tag sequences present a, ieas, one se, of base-spec, fi ca„y te^ 

Lr^:^ m ^ "* ^ » «- 50. where, mas. 
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,„ c ta8 Probe nuceo,;::;::; - pr ° be * ° nc ° r ™ ^ 

5 53. These, of mass-differemiated tag probes accordine to claim " wh, • >, 

comprises a, leas, one mass-modified heterocvdic base seC ^ "* ^ 

8 ra0 '^ ™^ - C-5. a thymine moietv mod^ I , c 5 ^ C °" S ' S,mS °'" 

modified at the C-5 m^thvi „ • IrlMal thymine moietv 

imeL 3 metlnl group, a uracil moien modified at r < „„ 
modified a, C- 8 , a c7-deazaade mn e mo.etv modified at C 8 a c 7 d 
0 modified a, C-7, a gua^ne mo ierv modified a, C-7 a c 7 de " "'^ 

C-8, a c7-deaza g uanine moietv modified at C 1 al "^S^ ""^ m ° dified * 
< Wahy„„e ^^^cT^^™^^ « «. a 
at C-7. Qac dea2 ahypoxanthine moiety modified 



^ntemucleotidic linka 2e of at l M « ... ,„„ " ° ^ phos P h °™ atoms of an 



' ^ ~*iww iu unc ur 

mtemucleotidic linkage of at leas, one tag probe. 

55. 



55. The set of mass-differentiated tag probes accordin. ,o claim 5. «* • . 

^^^^ 
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59. The set of mass-modified nucleic acid primers according to claim 39. wherein the 
mass-modifying functionality (M : is generated from a precursor functionality (PFi anached 
to the mass-modified primers, the precursor (PF) selected from a group consisting of -N5 and 
XR. wherein R is H and X is selected from a group consisting of -OH. -Nl-K -NHR. -SH. 

5 -NCS. -OCOf CH2VCOOH (where r = 1 -20). -NHCO(CH2) r COOH (where r = 1 -20 >. 
-OSO2OH. -OCOCCH2V 1 inhere r = 1-20). and -OP(0-Alkyl)N< AlkylK 

60. The set of mass-modified nucleotides according to claim 45. wherein the mass- 
modifying functionality (M) is selected from a group consisting of F, CI. Br. I. SnCFh 13. 

10 Si(CH 3 )2(C2H 5 ). SifCH3)(C : H 5 )2. Si(C2H 5 ) 3 , CH 2 F, CHF 2 . and CF 3 . 

61. The set of mass-modified nucleotides according to claim 45. wherein the mass- 
modifying functionality (M) is generated from a precursor functionality (PF) attached to the 
mass-modified nucleotides, the precursor (PF) selected from a group consisting of -N3 and 

15 XR, wherein R is H and X is selected from a group consisting of -OH, -NTb, -NHR. -SH, 
-NCS, -OCO(CH2) r COOH (where r = 1-20), -NHCO(CH2) r COOH (where r = 1-20). 
-OSCbOH. -OCO(CH2) r I (where r = 1-20), and -OP(0-Alkyl)N(Alkyl) 2 . 

62. The set of mass-differentiated tag probes according to claim 5 1 , w T herein the tag 

20 sequence is mass-modified with a mass-modifying functionality (M) selected from a group 
consisting of XR, F. CI Br, I. Si(CH 3 ) 3 , Si(CH 3 )2(C 2 H 5 ), Si(CH 3 )(C2H 5 )2, Si(C2H 5 ) 3 , 
CH7F. CHF2, and CF 3 , wherein X is selected from a group consisting of -OH, -NT-b- -NHR. 
-SH, -NCS. -OCO(CH 2 ) r COOH (where r = 1-20), -NHCO(CH 2 ) r COOH (where r = 1-20). 
-OS020H. -OCO(CH 2 ) r I (where r = 1-20). and -OP(0-Alkyl)N( 'Alkyl)2, and R is selected 

25 from a group consisting of H. methyl, ethyl, propyl, isopropyl, t-butyl. hexyl, benzyl, 

benzhydr>*l, trityl, substituted trityl. aryl, substituted aryl. polyoxymethylene. monoalkylated 
polyoxymethylene, a polyethylene imine. a polyamide of the general formula 
(-NH(CH2) r NHCO(CH2)rCO-) m . a polyamide of the general formula <-NH(CH 2 ) r CO-) m . a 
polyester of the general formula (-0(CH2) r CO-) m , an alkylated sihi compound of the 

30 general formula -Si( Y 13. a heterooiigo'polyaminoacid of the general formula 
(-NHCHaaCO-) m . a polyethylene glycol of the general formula 

-(CH2CH20) m -CH2CH 2 OH. and a monoalkylated polyethylene glycol of the general 
formula -(CH 2 CH20) m -CH2CH20-Y, where m is in the range of 0 to 200. Y is a lower 
alky] group selected from a group consisting of methyl, ethyl, propyl, isopropvl. t-hurv!. 
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63. The set of mass-differentiated tag probes according to claim 51. wherein the mass- 
modifying functionality (Mj is generated from a precursor functionality (PF) attached to the 
mass-differentiated tag probes, the precursor (PF) selected from a group consisting of -N- 
and XR. wherein R is H and X is selected from a group consisting of -OH. -NFK -NHR. 

5 -SH. -NCS.-OCO(CH2) r COOH (where r= 1-20). -NHCOfCH 2 ) r COOH (where r = 1-20). 
-OS0 2 OH. -OCO(CH2) r I (where r = 1-20), and -OP(0-Alkyl)N< Alkyl)2- 

64. The set of mass-modified nucleic acid pnmers according to claim 39, wherein the 
mass-modifying functionality (M) is given by the general formula XR in which X is selected 

10 from a group consisting of -OH. -NH2- -NHR. -SH. -NCS, -OCO(CH 2 ) r COOH (where r = 1- 
20), -NHCO(CH 2 ) r COOH (where r = 1-20). -OS0 2 OH. -OCO(CH 2 ) r l (where r = 1-20). and 
-OP(0-AlkyI)N(Alkyl)2. and R is selected from a group consisting of H, methyl, ethyl, 
propyl, isopropyl. t-butyl. hexyl. benzyl, benzhydryl. trityl. substituted trityL aryl. substituted 
aryh polyoxymethylene. monoalkylated polyoxymethylene, a polyethylene imine, a 

15 pclyamidc of the general formula (-NH(CH2) r NHCO(CH2) r CO-) m , a polyamide of the 

general formula (-NH(CH2) r CO-) m , a polyester of the general formula (-0(CH2) r CO-) m , an 
alkylated silyl compound of the general formula -Si(Y)3. a heterooligo/polyaminoacid of the 
general formula (-NHCHaaCO-) m , a polyethylene glycol of the general formula 
-(CH2CH20) m -CH2CH20H, and a monoalkylated polyethylene glycol of the general 

20 formula -(CH 2 CH 2 0) m -CH2CH20-Y, where m is in the range of 0 to 200, Y is a lower 
alkyl group selected from a group consisting of methyl, ethyl, propyl, isopropyl, t-butvl, 
hexyl, r is in the range of 1 to 20, and aa represents the amino acid side chain of a naturally- 
occurring amino acid. 
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65. The sei of mass-modified nucleotides according to claim 45. wherein the mass- 
modifying functionality (M) is given by the general formula XR in which X is selected from 
a group consisting of -OH. -\'H : . -\ T HR. -SH. -NCS. -OCO(CH 2 ) r COOH (where r = ] -20!. 
-NHCO(CH 2 ) r COOH (where r = 1-20). -OS0 2 OH. -OCO(CH 2 )~I (where r = 1-20). and 
-OP(0-Alkyl)N(Alky!) 2 . and R is selected from a group consisting of H. methyl. eth\ l. 
propyl, isopropyl. t-buryl. hexyl. benzyl, benzhydryl. triryl. substituted trityl. aryl. substituted 
aryl, polyoxymethylene. monoalkylated polyoxymethylene. a polyethylene imine. a 
polyamide of the general formula f-NHfCH 2 ) r NHCO(CH 2 ) r CO-) m . a polyamide of the 
general formula (-NH(CH 2 > r CO-) m . a polyester of the general formula (-OfCH 2 ) r CO- , m . an 
alkylated sily] compound of the general formula -Si(Y) 3 . a heterooligo/polyaminoacid of the 
general formula (-NHCHaaCO-) m . a polyethylene glycol of the general formula 
-(CH 2 CH 2 0) m -CH 2 CH 2 OH. and a monoalkylated polyethylene glycol of the eenera! 
formula -(CH 2 CH 2 0) m -CH 2 CH 2 0-Y. where m is in the range of 0 to 200. Y is a lower 
alkyl group selected from a group consisting of methyl, ethyl propyl, isopiopyi. t-buryl. 
hexyl. r is in the range of 1 to 20. and aa represents the amino acid side chain of a narurally- 
occurrine amino acid. 



66. A kit for sequencing nucleic acids by mass spectrometry, comprising: 
a) a solid support having a linking functionality (L'); 
0 b) a set of nucleic acid primers suitable for initiating synthesis of a set of 

complementary nucleic acids which are complementary to the different 
species of nucleic acids, the primers each including a linking group (L ) able 
to interact with the linking functionality (L') and reversibly immobilize the 
primers on the solid support; 

5 c) a set of chain-elongating nucleotides for synthesizing the complementary 

nucleic acids; 

d) a set of chain-terminating nucleotides for terminating synthesis of the 

complementary nucleic acids and generating sets of base-specific terminated 
complementary nucleic acid fragments; and 
3 e) a polymerase for synthesizing the complementary nucleic acids from the 

primers, chain-elongating nucleotides and chain-terminating nucleotides, 
wherein the chain-terminating nucleotides are mass-modified so that addition of 
one species of the chain-terminating nucleotides to the complementary nucleic acid 
can be distinguished by mass spectrometry from addition of all other specie? of 



chain-*? 



iragments are cleaved from the solid support prior tc mass spectrometry 
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68. The method according to claim 32, wherein the base-specifically terminated 
fragments are cleaved from the solid support during mass spectrometry. 

69 The solid support according to claim 36. wherein the photocleavable bond of the 
5 linkage, L-L\ is selected from the group consisting of a charge transfer complex or a stable 
organic radical. 

70. The method according to claim 32, wherein the reversible linkage is a 
photocleavable bond. 

10 

71 . The method according to claim 33. wherein the reversible linkage is a 
photocleavable bond. 

72. The method according to claim 33, wherein the base-specifically terminated 
15 fragments are cleaved from the solid support prioi io mass spectrometry. 



73. The method according to claim 33, wherein the base-specifically terminated 
fragments are cleaved from the solid support during mass spectrometry. 
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